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Dear Sir: 

The Applicants appeal the rejection of pending Claims 6-7, 9 and 11-17 in the above- 
captioned patent application. These claims were rejected in an Office Action dated October 4, 
2005. Applicants mailed a Notice of Appeal December 22, 2005. 
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I. REAL PARTY IN INTEREST 

Pursuant to 37 C.F.R. 41.37(c)(1), Appellants hereby notify the Board of Patent Appeals 
and Interferences that the real party in interest is the assignee of record for this application, 
Genentech, Inc., 1 DNA Way, South San Francisco, CA 94080. 



IL 



RELATED APPEALS AND INTERFERENCES 

Appellants are unaware of any other related appeals or interferences. 
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IIL STATUS OF THE CLAIMS 

The above-captioned application was filed with Claims 1-13. Appellants cancelled 
Claims 8 and 10 in an amendment mailed December 7, 2004. Appellants cancelled Claims 1-3 
and added new Claims 14-17 in a Submission filed with Request for Continued Examination 
dated June 29, 2005. The Examiner rejected Claims 4-7, 9 and 1 1-17 in an Office Action mailed 
October 4, 2005. Appellants filed an Notice of Appeal and "Amendment After Final" on 
December 22, 2005, canceling Claims 4 and 5 and amending Claim 12 to depend from Claim 6. 
The claims attached hereto as Appendix A reflect the claims as amended by the "Amendment 
After Final." 



IV. STATUS OF AMENDMENTS 

Appellants mailed a Notice of Appeal and "Amendment After Final" on December 22, 
2005 canceling Claims 4 and 5 and amending Claim 12 to depend from Claim 6, for purposes of 
appeal. 



V. SUMMARY OF THE CLAIMED SUBJECT MATTER 

The claimed subject matter relates to isolated polypeptides related to the polypeptide 
having SEQ ID NO: 10. Independent Claims 6 and 14 read: 

6. An isolated polypeptide comprising: 

(a) the amino acid sequence of the polypeptide having the amino acid 
sequence of amino acids 34-321 of SEQ ID NO: 10; 

(b) the amino acid sequence of the polypeptide having the amino acid 
sequence of amino acids 81-109 or 232-253 of SEQ ID NO: 10; or 

(c) the amino acid sequence of the polypeptide encoded by nucleotides 
100-966 of the cDNA deposited under ATCC accession number 209922. 

14. An isolated polypeptide having at least 95% amino acid sequence 
identity to: 

(a) the amino acid sequence of the polypeptide having the amino acid 
sequence of amino acids 34-321 of SEQ ID NO: 10; 

(b) the amino acid sequence of the polypeptide having the amino acid 
sequence of amino acids 81-109 or 232-253 of SEQ ID NO: 10; or 

(c) the amino acid sequence of the polypeptide encoded by nucleotides 
100-966 of the cDNA deposited under ATCC accession number 209922; 
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wherein said isolated polypeptide or a fragment thereof can be used to 
generate an antibody which can be used to specifically detect the 
polypeptide of SEQ ID NO: 10 in lung tissue samples. 

Various aspects of the claimed polypeptides are described in the specification at, for 
example, paragraphs [0012]-[0017], [0021], [0023], [0026], [0035]-[0036], [0198]-[0208], 
[0221], [0225], [0229], [0231], [0247], [0253]-[0271], [0280]-[0282], [0336], [0364], [0367], 
[0369], [0454]-[0499] and [0530], and Figure 10. SEQ ID NO: 10 is disclosed in the Sequence 
Listing appended to the application. 

VI. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL AND GROUPING 
OF CLAIMS 

A. Grounds of Rejection on Appeal 

The Examiner has rejected pending Claims 6-7, 9 and 11-17 under 35 U.S.C. §101, 
stating that the claimed invention is not supported by either a specific and substantial asserted 
utility or a well-established utility. Office Action at 2.^ 

The Examiner has rejected pending Claims 6-7, 9 and 11-17 under 35 U.S.C. §112, first 
paragraph as lacking an enabling disclosure, "since the claimed invention is not supported by 
either a specific and substantial utility or a well established utility." Id. at 20. In addition, 
pending claims 12-17 are rejected as failing to comply with the enablement requirement because 
"the scope of enablement provided to the skilled artisan by the disclosure is not commensurate 
vsdth the scope of the protection sought by the claims." Id. at 22-23. 

The Examiner has rejected pending Claims 12-17 under 35 U.S.C. § 1 12, first paragraph, 
as lacking an adequate written description, stating that the rejected claims contain "subject matter 
which was not described in the specification in such a way as to reasonably convey to one skilled 
in the relevant art that the inventor(s), at the time the application was filed, had possession of the 
claimed invention." Id, at 23. 



* Unless stated otherwise, citation herein to "Office Action" refers to the Office Action 
mailed October 4, 2005. 
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Finally, the Examiner has rejected pending Claims 6-7, 9 and 11-17 under 35 U.S.C. 
§112, first paragraph, as containing new matter. Id. at 25-27. 

B. Grouping of Claims 

1. Utility Rejection - Claims 6-7. 9 and 11-1 7 

For purposes of the utility rejection under 35 U.S.C. § 101, Claims 6-7, 9 and 1 1-17 can 
be considered as a group. 

2. Enablement Rejection - Claims 6-7, 9 and 11-17 

a. Group 1 - Claims 6-7, 9 and 11-13 

For purposes of the enablement rejection under 35 U.S.C. § 1 12, first paragraph, Claims 
6-7, 9 and 11-13 can be considered as a group. 

b. Group 2 - Claims 14 and 16-1 7 

For purposes of the enablement rejection under 35 U.S.C. § 112, first paragraph, Claims 

14 and 16-17 can be considered as a group. 

c. Group 3 - Claim 15 

For purposes of the enablement rejection under 35 U.S.C. § 1 12, first paragraph, Claim 

15 can be considered individually. 

3. Written Description Rejection - Claims 12-17 

a. Group 1 - Claims 12-13 

For purposes of the written description rejection under 35 U.S.C. § 1 12, first paragraph, 
Claims 12-13 can be considered as a group. 

b. Group 2 - Claims 14 and 16-1 7 

For purposes of the written description rejection under 35 U.S.C. § 1 12, first paragraph, 
Claims 14 and 16-17 can be considered as a group. 

c. Group 3 - Claim 15 

For purposes of the written description rejection under 35 U.S.C. § 1 12, first paragraph. 
Claim 15 can be considered individually. 
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4. New Matter Rejection - Claims 6-7, 9 and 11-1 7 

a. Group 1 - Claims 6. 12-1 7 

For purposes of the new matter rejection under 35 U.S.C. § 1 12, first paragraph, Claims 6 
and 12-17 can be considered as a group. 

b. Grow 2 - Claim 7 

For purposes of the new matter rejection under 35 U.S.C. § 1 12, first paragraph, Claim 7 
can be considered individually. 

c. Group 3 - Claim 9 

For purposes of the new matter rejection under 35 U.S.C. § 1 12, first paragraph, Claim 9 
can be considered individually. 

d. Group 4 - Claim 11 

For purposes of the new matter rejection under 35 U.S.C. § 1 12, first paragraph, Claim 1 1 
can be considered individually. 

VII. APPELLANTS^ ARGUMENT 

A, Summary of Arguments 

1. Utility Rejection 

The first issue before the Board is whether Appellants have asserted at least one "specific, 
substantial, and credible utility" for the claimed subject matter. See Examination Guidelines, 66 
Fed. Reg. 1092 (2001). Appellants have asserted that the claimed polypeptides related to the 
polypeptide of SEQ ID NO: 10 (the PR0874 polypeptide) are useful as diagnostic tools for 
cancer, particularly for lung cancer. This asserted utility is specific, substantial, and credible. 

Briefly stated, Appellants' asserted utility is based on the disclosure in Example 18 of the 
instant application that the mRNA encoding the PR0874 polypeptide is more highly expressed in 
normal lung tissue compared to lung tumor tissue. It is well-established that there is a reasonable 
correlation between changes in mRNA level for a particular gene and a corresponding change in 
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the level of expression of the encoded polypeptide, such that increasing or decreasing the amount 
of mRNA for a particular gene leads to a corresponding increase or decrease in the amount of the 
encoded protein. Thus, one of skill in the art would be more likely than not to believe that, like 
the PR0874 mRNA, the PR0874 protein is more highly expressed in normal lung tissue 
compared to lung tumor tissue. This differential expression of PR0874 polypeptide is useful for 
distinguishing lung tumor tissue from its normal tissue counterpart. Therefore, the claimed 
polypeptides related to the PR0874 polypeptide have a specific, substantial and credible utility as 
diagnostic tools for cancer, particularly lung cancer, as is explained in more detail below. 

2. Enablement Rejection 

The second issue before the Board is whether Appellants have enabled the pending claims 
such that one of skill in the art would be able to make and use the claimed invention. The 
Examiner has rejected pending Claims 6-7, 9 and 11-17 under 35 U.S.C. §112, first paragraph as 
lacking an enabling disclosure, because the claimed invention is not supported by either a 
specific and substantial utility or a well established utility. Office Action at 20. In addition, 
pending Claims 12-17 are rejected because "the scope of enablement provided to the skilled 
artisan by the disclosure is not commensurate with the scope of the protection sought by the 
claims." Id at 22-23. 

Appellants submit that Claims 6-7, 9 and 11-17 are enabled such that one of skill in the 
art could make and use the claimed polypeptides without undue experimentation. First, for the 
reasons discussed in detail below, the claimed invention is supported by a specific, substantial 
and credible utility. Accordingly, Appellants maintain that the specification enables one skilled 
in the art to make and use the claimed polypeptides. 

With respect to Claims 12-13, how to make chimeric polypeptides comprising the various 
polypeptides related to SEQ ID NO: 10 and the polypeptide encoded by the cDNA deposited 
under ATCC accession number 209922 is within the skill in the art. Similarly, with respect to 
Claims 14-17, it is well Mathin the skill of those in the art to make polypeptides that are at least 
95% identical to the recited sequences related to SEQ ID NO: 10 and the polypeptide encoded by 
the recited portions of sequence ATCC 209922, and it is well within the knowledge of those 
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skilled in the art how to make antibodies which are specific to a disclosed sequence. See In re 
Wands, 858 F.2d 731 (reversing the Board's decision of non-enablement and holding that as of 
1980, undue experimentation was not required to make high-affinity monoclonal antibodies to a 
target peptide). 

Appellants assert that the claimed polypeptides are useful as diagnostic tools for cancer, 
particularly lung cancer. This use is based on the disclosure in Example 18 of the instant 
application that the nucleic acid encoding the PR0874 polypeptide is differentially expressed at 
least two-fold in lung tumor compared to normal lung tissue. As detailed below, it is well- 
established that changes in the expression level of mRNA is correlated with a change in the 
expression level of the encoded polypeptide, and thus it is likely that the PR0874 polypeptide is 
also differentially expressed in lung tumors. Thus, based on the disclosure in the application, one 
of skill in the art would be able to use the claimed polypeptides as diagnostic tools to distinguish 
suspected lung tumor tissue from normal lung tissue without undue experimentation. 

3. Written Description Rejection 

The third issue before the Board is whether the claimed subject matter is described in the 
specification in such a way as to reasonably convey to one skilled in the art that the inventors had 
possession of the claimed invention at the time the application was filed. The Examiner has 
rejected pending Claims 12-17 under 35 U.S.C. §112, first paragraph, as lacking an adequate 
written description. Office Action at 23. 

The Examiner rejects claims 12-13 based on their dependency from Claim 4, arguing that 
the limitation "wherein said isolated polypeptide is more highly expressed..." is in adequate 
because "no information is provided in the differential analysis of PR0874 polynucleotide 
expression regarding the level of expression, activity, or role in cancer of the PR0874 
polypeptide." Id, at 23. In rejecting Claims 14-17, the Examiner argues that ''[bjecause the 
specification does not describe any biological activity of the claimed polypeptides and because 
the claims are not limited to any specific biological activity of the claimed polypeptides, the 
present claims are not analogous to example 14 of the written description guidelines." Id, at 24- 
25. 
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Appellants submit that they have satisfied the vsa*itten description requirement for rejected 
Claims 12-17. The Examiner's argument regarding Claims 12-13 is moot in light of the 
amendment of Claims 12-13 to depend from Claim 6, which the Examiner has not rejected as 
lacking an adequate written description. 

As for the rejection of Claims 14-17, Appellants submit that the claims are adequately 
described based on the actual reduction to practice of SEQ ID NO: 10, by specifying a high level 
of amino acid sequence identity, and by describing how to make and use antibodies to the 
disclosed sequence. Contrary to the Examiner's assertion, these facts are directly analogous to 
those of Example 14 of the Written Description Guidelines published by the PTO, since the 
claims are limited to polypeptides with the biological function of being "used to generate an 
antibody which can be used to specifically detect the polypeptide of SEQ ID NO: 10 in lung tissue 
samples." In addition, like the case In re Wallach where a single polypeptide sequence was 
sufficient written description for the vast number of nucleic acids which encode the polypeptide, 
the description of the single species SEQ ID NO: 10 is sufficient to place the Appellants' in 
possession of the claimed genus because those of skill in the art recognize the correlation 
between polypeptide structure and the ability to generate specific antibodies. For these reasons 
as detailed below, Appellants submit that Claims 12-17 are adequately described such that one of 
skill in the art would recognize that the inventors had possession of the claimed invention at the 
time the application was filed. 

4. New Matter Rejection 

The final issue before the Board is whether amendments to the claims are properly 
supported by the specification as filed. The Examiner has rejected pending Claims 6-7, 9 and 11- 
17 under 35 U.S.C. §112, first paragraph, as containing new matter. Office Action at 27. 

With respect to amino acid species starting at methionine residue #34, the Examiner 
argues that there is no "express, implicit, or inherent support for this species to the exclusion of 
all other species." Id. at 25. With respect to the limitations reciting residues 81-109 and IZl-lST^ 
of SEQ ID NO: 10, the Examiner argues that "the specification does not disclose a fragment of a 
PRO polypeptide that is the intracellular domain. The newly added limitations '81-109' and 
'232-253' imply that '81-109' and '232-253' are the extracellular domains." Id. at 26-27. 
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Appellants submit that paragraph [0196] states that "other methionine residues located 
either upstream or downstream from the amino acid position 1 in the figures may be employed as 
the starting amino acid residue for the PRO polypeptides." Specification at If [0196] (emphasis 
added). Figure 10 and SEQ ID NO: 10 show 8 methionine residues in the sequence, one of which 
begins with the methionine at residue #34. Combining the statement in paragraph [0196] with 
the disclosure of SEQ ID NO: 10, Appellants have conveyed with reasonable clarity to those 
skilled in the art that, as of the filing date sought, Appellants were in possession of polypeptides 
of SEQ ID NO: 10 beginning at any of the methionine residues listed in SEQ ID NO: 10. 
Therefore, Appellants were clearly in possession of "the polypeptide having the amino acid 
sequence of amino acids 34-321 of SEQ ID NO: 10" at the time of filing, and the related 
amendments do not constitute new matter. 

With respect to the amendment "the polypeptide having the amino acid sequence of 
amino acids 81-109 or 232-253 of SEQ ID NO: 10," Figure 10 discloses the following fragments 
of SEQ ID NO:10: amino acids 57-80, 110-126, 215-231, and 254-274. Implicit in the 
disclosure of fragments 57-80 and 1 10-126 of SEQ ID NO: 10 is the disclosure of the fragment 
81-109 of SEQ ID NO:10. Likewise, the disclosure of fragments 110-126 and 215-231 of SEQ 
ID NO: 10 implicitly discloses the fragment 232-253 of SEQ ID NO: 10. Therefore, in light of 
Figure 10, Appellants were clearly in possession of the subject matter "the polypeptide having 
the amino acid sequence of amino acids 81-109 or 232-253 of SEQ ID NO: 10" at the time of 
filing, and the amendments do not constitute new matter. 

B. Utility Rejection - Detailed Arguments 

The first issue before the Board is whether Appellants have asserted at least one "specific, 
substantial, and credible utility." See Examination Guidelines, 66 Fed. Reg. 1092 (2001). 
Appellants have asserted that the claimed polypeptides related to SEQ ID NO: 10 (the PR0874 
polypeptide) are useful as diagnostic tools for cancer, particularly for lung cancer. This asserted 
utility is specific, substantial, and credible, as is explained in more detail below. 
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1. Utility - Lesal Standard 

A "specific utility" is defined as utility which is "specific to the subject matter claimed," 
in contrast to "a general utility that would be applicable to the broad class of the invention." See 
M.P.E.P, § 2107.01 I. For example, it is generally not enough to state that a nucleic acid is 
useful as a diagnostic tool without also identifying the condition that is to be diagnosed. 

The requirement of "substantial utility" defines a "real world" use, and derives from the 
Supreme Court's holding in Brenner v. Manson, 383 U.S. 519, 534 (1966) stating that "[t]he 
basic quid pro quo contemplated by the Constitution and the Congress for granting a patent 
monopoly is the benefit derived by the public from an invention with substantial utility." In 
explaining the "substantial utility" standard, M.P.E.P. § 2107.01 cautions, however, that Office 
personnel must be carefijl not to interpret the phrase "immediate benefit to the public" or similar 
formulations used in certain court decisions to mean that products or services based on the 
claimed invention must be "currently available" to the public in order to satisfy the utility 
requirement, "Rather, any reasonable use that an applicant has identified for the invention that 
can be viewed as providing a public benefit should be accepted as sufficient at least with regard 
to defining a 'substantial' utility." M.P.E.P. § 2107.01 (emphasis added). 

Indeed, the Guidelines for Examination of Applications for Compliance With the Utility 
Requirement, set forth in M.P.E.P, § 2107 11(B)(1) gives the following instruction to patent 
examiners: "If the applicant has asserted that the claimed invention is useful for any particular 
practical purpose . . . and the assertion would be considered credible by a person of ordinary skill 
in the art, do not impose a rejection based on lack of utility." 

Finally, in assessing the credibility of the asserted utility, the M.P.E.P. states that "to 
overcome the presumption of truth that an assertion of utility by the applicant enjoys" the PTO 
must establish that it is "more likely than not that one of ordinary skill in the art would doubt 
(i.e., 'question') the truth of the statement of utility." M.P.E.P. § 2107.02 III A. 

2. Utility - Burden of Proof 

It is well established that a specification which contains a disclosure of utility which 
corresponds in scope to the subject matter sought to be patented "must be taken as sufficient to 
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satisfy the utility requirement of § 101 for the entire claimed subject matter unless there is reason 
for one skilled in the art to question the objective truth of the statement of utility or its scope." In 
re Langer, 503 F.2d 1380, 1391, 183 U.S.P.Q. 288, 297 (C.C.P.A. 1974). Thus "the PTO has the 
initial burden of challenging a presumptively correct assertion of utility in the disclosure." In re 
Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). Only after the PTO provides 
evidence showing that one of ordinary skill in the art would reasonably doubt the asserted utility 
does the burden shift to the applicant to provide rebuttal evidence sufficient to convince such a 
person of the invention's asserted utility. Id 

3. Utility - Standard of Proof 

Compliance with 35 U.S.C. § 101 is a question of fact. Raytheon v. Roper, 724 F.2d 951, 

956, 220 U.S.P.Q. 592, 596 (Fed. Cir. 1983). The evidentiary standard to be used throughout ex 

parte examination in setting forth a rejection is a preponderance of the evidence, or "more likely 

than not" standard. In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 1444 (Fed. Cir. 

1992). This is stated explicitly in the M.P.E.P.: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a whole, 
it leads a person of ordinary skill in the art to conclude that the asserted utility is 
more likely than not true . M,P.E.P, § 2107.02, part VII (emphasis in original, 
citations omitted). 

The Court of Appeals for the Federal Circuit has stated that the standard for satisfying the 

utility requirement is a low one: 

The threshold of utility is not high : An invention is "useful" under section 101 if 
it is capable of providing some identifiable benefit. See Brenner v. Manson, 383 
U.S. 519, 534, 86 S.Ct. 1033, 16 L.Ed.2d 69 (1966); Brooktree Corp v. Advanced 
Micro Devices, Inc, 977 F.2d 1555, 1571 (Fed. Cir. 1992) ("To violate § 101 the 
claimed device must be totally incapable of achieving a useful result"); Fuller v. 
Berger, 120 F. 274, 275 (7th Cir. 1903) (test for utility is whether invention "is 
incapable of serving miy beneficial end"). Juicy Whip, Inc. v. Orange Bang, Inc., 
185 F.3d 1364, 1366, 51 U.S.P.Q. 2d 1700 (Fed. Cir. 1999) (emphasis added). 
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The low threshold for satisfying the utility requirement is reflected in the standard set by the 

Federal Circuit for invalidating a patent based on a lack of utility: "[T]he fact that an invention 

has only limited utility and is only operable in certain applications is not grounds for finding lack 

of utility. Some degree of utility is sufficient for patentability. Further, the defense of non-utility 

cannot be sustained without proof of total incapacity ." Envirotech Corp, v. Al George, Inc., 730 

F.2d 753, 762, 221 U.S.P.Q. 473 (Fed. Cir. 1984) (emphasis added, citations omitted). 

Because the standard for satisfying the utility requirement is so low, requiring total 

incapacity for a finding of no utility, the M.P.E.P. cautions that: 

Rejections under 35 U.S.C. 101 have been rarely sustained by federal courts. 
Generally speaking, in these rare cases, the 35 U.S.C. 101 rejection was sustained 
[] because the applicant . . . asserted a utility that could only be true if it violated a 
scientific principle, such as the second law of thermodynamics, or a law of nature, 
or was wholly inconsistent with contemporary knowledge in the art. M.P.E.P. § 
2107.02 III B., citing In re Gazave, 379 F.2d 973, 978, 154 U.S.P.Q. 92, 96 
(C.C.P.A. 1967) (underline emphasis in original, italic emphasis added). 

4. Appellants Asserted a Specific, Substantial and Credible Utility that is 
Sufficient to Satisfy the Utility Requirement ofS 101 

The claimed subject matter is directed to polypeptides comprising the amino acid 
sequence of the polypeptides related to SEQ ID NO: 10, or the amino acid sequence of the 
polypeptide encoded by the recited portion of the full-length coding sequence of the cDNA 
deposited under ATCC accession number 209922. Additional claimed subject matter is directed 
to polypeptides having at least 95% amino acid sequence identify to the recited sequences, 
wherein the isolated polypeptide or a fragment thereof can be used to generate an antibody which 
can be used to specifically detect the polypeptide of SEQ ID NO: 10 in lung tissue samples. The 
polypeptide of SEQ ID NO: 10 (referred to as "PR0874 polypeptide") is encoded by the 
polynucleotide of SEQ ID N0:9 (also referred to as DNA40621-1440). Specification at Iffl 
[0035-0036]. Appellants have asserted that the claimed polypeptides are useful as diagnostic 
tools for cancer, particularly lung cancer. 

In "Example 18: Tumor Versus Normal Differential Tissue Expression Distribution" 
Appellants disclose that the mRNA encoding PR0874 polypeptide is more highly expressed in 
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normal lung tissue compared to lung tumor tissue. Specification at [0529-0530] and 
accompanying tables. As explained in paragraph [0530], the differential expression of the 
PR0874 mRNA was detected using the well-established technique of quantitative PGR 
amplification of cDNA libraries isolated from different human normal and tumor tissue samples. 
To ensure that equivalent amounts of nucleic acid were used in each reaction, the cDNA for p- 
actin was used as a control. 

The specification teaches that identification of the differential expression of a PRO 
polypeptide-encoding mRNA in one or more tumor tissues as compared to one or more normal 
tissues of the same tissue type "renders the molecule useful diagnostically for the determination 
of the presence or absence of tumor in a subject suspected of possessing a tumor." Specification 
at [0530]. Because it is well established that changes in mRNA levels lead to changes in the 
level of the encoded protein, based on the teachings in the specification, one would expect the 
PR0874 protein to be differentially expressed in lung tumors. The specification states that PRO 
polypeptides "may also be used diagnostically for tissue typing, wherein the PRO polypeptides of 
the present invention may be differentially expressed in one tissue as compared to another, 
preferably in a diseased tissue as compared to a normal tissue of the same tissue type." 
Specification at ^ [0336]. The specification also discloses that PRO polypeptides and 
polypeptides related thereto can be used to generate anti-PRO antibodies. Id at [0364] and 
[0367]. The specification teaches that such antibodies to PRO polypeptides can be useful as 
diagnostic tools: 

[A]nti-PRO antibodies may be used in diagnostic assays for PRO [polypeptide], 
e.g., detecting its expression (and in some cases, differential expression) in 
specific cells, tissues, or serum. Various diagnostic assay techniques known in the 
art may be used, such as competitive binding assays, direct or indirect sandwich 
assays and immunoprecipitation assays conducted in either heterogeneous or 
homogeneous phases. Specification at [0407]. 

Taken together, the specification clearly discloses the use of the claimed polypeptides as 
diagnostic tools for cancer, particularly lung cancer. This utility is substantial, as one of skill in 
the art will recognize that the diagnosis of cancer is a "real world" use; it is specific, as the 
diagnosis of lung cancer is not a utility that applies to the broad class of polypeptides; and it is 



-13- 



Appl.No. : 10/063,514 

Filed : May 1,2002 



credible, as it not a utility "that could only be true if it violated a scientific principle, ...or a law 
of nature, or [is] wholly inconsistent with contemporary knowledge in the art." M.P.E.P. § 
2107.02 III B., citing/n re Gazave, 379 F.2d 973, 978, 154 U.S.P.Q. 92, 96 (CC.P.A. 1967). 

Because Appellants' specification contains a disclosure of utility which corresponds in 
scope to the claimed subject matter, the asserted utility "must be taken as sufficient to satisfy the 
utility requirement of § 101 for the entire claimed subject matter unless there is reason for one 
skilled in the art to question the objective truth of the statement of utility or its scope." In re 
Longer, 503 F.2d 1380, 1391, 183 U.S.P.Q. 288, 297 (CC.P.A. 1974). Therefore, the burden of 
establishing a prima facie case of lack of utility rests with the PTO. See, In re Brana, 51 F.3d 
1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995) ("the PTO has the initial burden of challenging 
a presumptively correct assertion of utility in the disclosure"). 

5. The Examiner's Arsuments 

In the Office Action dated October 4, 2005, the Examiner rejected the pending claims, 
stating "Claims 4-7, 9, 1 1-17 are rejected under 35 U.S.C. 101 because the claimed invention is 
not supported by either a specific and substantial asserted utility or a well established utility." 
Office Action at 2. 

To establish a prima facie showing that the claimed subject matter lacks utility, the 
Examiner must "provide[] evidence showing that one of ordinary skill in the art would 
reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 
(Fed. Cir. 1995). The Examiner has issued a first Office Action, a final Office Action, and an 
Office Action following Request for Continued Examination during the prosecution of the 
instant application. None of these papers provide any evidence that one of ordinary skill in the 
art would reasonably doubt the asserted utility. 

The Examiner has rejected Appellants' asserted utility, stating that "the countervailing 
evidence shows that the significance or relevance of the PR0874 mRNA expression to the 
asserted therapeutic and diagnostic utilities of the claimed PR0874 polypeptide is unknown, and 
that the skilled artisan would have a reasonable, legitimate basis to doubt the utility of the 
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PR0874 polypeptide." Office Action at 2-3. For the reasons discussed below, the Examiner's 
countervailing references are not sufficient to support his rejection of the pending claims. 

First, the Examiner cites Hu et ai (J. Proteome Res. 2003; 2(4):405-12) to support his 
statement challenging the significance and relevance of the data presented in Example 18: "the 
literature cautions researchers from drawing conclusions based on small changes in transcript 
expression levels between normal and cancerous tissue." Id at 6. The Examiner also quotes 
from an editorial by LaBaer (Nature Biotechnol. 2003; 21(9):976-7), an author of the Hu 
reference, as stating "reports of mRNA or protein changes of as little as two-fold are not 
uncommon, and although some changes of this magnitude turn out to be important, most are 
attributable to disease-independent differences between samples." Id, at 8. According to the 
Examiner, this evidence shows that "it cannot be determined if the difference, if any, of PR0874 
mRNA expression between normal lung and lung tumor is significant or insignificant, disease- 
dependent or -independent, relevant or irrelevant." Id, at 8. 

To support his assertion that "protein levels cannot be accurately predicted from the level 
of the corresponding mRNA transcript," the Examiner cites Haynes et al. (Electrophoresis 1998; 
19(1 1):1862-71) and Gygi et al (Mol. and Cell. Bio. 1999; 19(3): 1720-30). Office Action at 12- 
14. The Examiner also cites a number of references originally cited by Appellants to support the 
assertion that there are numerous levels of control of protein synthesis, a fact Appellants have 
acknowledged, but which is not contrary to their asserted utility. Id. at 16-18. 

Finally, the Examiner cites Yousef et al (Cancer Res. 2003; 63(9):2223-7) to support the 
assertion that "even in cases where genes are differentially expressed in a cancer and 
overexpression of the corresponding protein products in cancer is verified, the art indicates that 
further experimentation is necessary to establish the usefulness of these genes and gene products 
for diagnosis, prognosis, and treatment of cancer." Id. at 9. 

The remainder of the Examiner's arguments are unsupported conclusory statements, or 
simply restatements of the assertions discussed above, which the Examiner repeats throughout 
the Office Action. Therefore, they cannot provide any additional basis for the Examiner's 
rejection. 



-15- 



Appl. No. : 10/063,514 

Filed : May 1,2002 



6. The Examiner has not established a Prima Facie case that Claims 6-7, 9 and 
1M7 lack Utility 

The above arguments do not satisfy the Examiner's burden to "provide[] evidence 
showing that one of ordinary skill in the art would reasonably doubt the asserted utility," In re 
Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). The Examiner has the burden 
of presenting "countervailing facts and reasoning sufficient to establish that a person of ordinary 
skill would not believe the Appellant's assertion of utility." M.P.E.P, at §2107.02 III.A., citing 
in re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995) ("Only after the PTO 
provides evidence showing that one of ordinary skill in the art would reasonably doubt the 
asserted utility does the burden shift to the Appellant to provide rebuttal evidence") (emphasis 
added). With the exception of the specific references cited by the Examiner, the Examiner's 
assertions are not supported by facts, evidence, or reasoning. Instead, the Examiner simply 
makes unsupported assertions. As for the references cited by the Examiner, for the reasons 
discussed below, they do not support the Examiner's position. Therefore there is simply no 
evidence in the record to support the Examiner's assertion that Appellants' asserted utility is not 
substantial, and the invention is incomplete. Absent some substantial evidence to support his 
assertions, the Examiner has failed to establish a prima facie showing that one of skill in the art 
would reasonably doubt the asserted utility, and the Board should accept Appellants' disclosed 
utility as sufficient. 

a. The data in Example 18 are sufficient to establish the asserted utility 
Appellants turn first to the Examiner's arguments challenging the reliability of the data 
reported in Example 18. The Examiner argues that Example 18 and the first declaration of 
Mr, Grimaldi are insufficient to overcome the utility rejection. In particular, the Examiner cites 
Hu et al. and LaBaer as countervailing evidence to support the assertion that "it cannot be 
determined if the differences, if any, of PR0874 mRNA expression between normal lung and 
lung tumor is significant or insignificant, disease-dependent or -independent, relevant or 
irrelevant." Office Action at 8. 
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As an initial matter, Appellants note that, other than the Examiner's assertions based on 
Hu et ai and LaBaer, the remainder of the objections to the data reported in Example 18 are not 
supported by any evidence or reasoning as to why the data are insufficient, and therefore they 
cannot establish a prima facie case. See In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 
(Fed. Cir. 1995) ("Only after the PTO provides evidence showing that one of ordinary skill in the 
art would reasonably doubt the asserted utility does the burden shift to the Appellant to provide 
rebuttal evidence.") (emphasis added). Despite this deficiency, Appellants address the 
Examiner's objections below. 

The gene expression data in Example 18 of the specification show that the mRNA 
associated with protein PR0874 was more highly expressed in normal lung tissue compared to 
lung tumor tissue. See Specification at T| [0530] and accompanying tables. Gene expression was 
analyzed using standard quantitative PGR amplification reactions of cDNA libraries isolated 
from different human tumor and normal human tissue samples. Id, It is well known in the art 
that the number of copies of a particular cDNA in the cDNA library is determined by the number 
of copies of the corresponding mRNA in the sample. Therefore, the cDNA libraries can be used 
to determine the level of expression of the corresponding mRNA in the tissue. 

Appellants have asserted that identification of the differential expression of the PR0874 
polypeptide-encoding gene in tumor tissue compared to the corresponding normal tissue renders 
antibodies against the polypeptide useful as a diagnostic tool for the determination of the 
presence or absence of tumor. Id. In support of this asserted utility. Appellants submitted on 
December 7, 2004 as Exhibit 1 to their Amendment and Response to Office Action a first 
Declaration of J. Christopher Grimaldi, an expert in the field of cancer biology. This declaration 
explains the importance of the data in Example 18, and how differential gene and protein 
expression studies are used to differentiate between normal and tumor tissue. See First Grimaldi 
Declaration, 

In paragraphs 6 and 7, Mr. Grimaldi explains that the semi-quantitative analysis 
employed to generate the data of Example 1 8 is sufficient to determine if a gene is over- or 
under-expressed in tumor cells compared to corresponding normal tissue. He states that any 
visually detectable difference seen between two samples is indicative of at least a two-fold 
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difference in cDNA between the tumor tissue and the counterpart normal tissue. He also states 
that the results of the gene expression studies indicate that the genes of interest "can be used to 
differentiate tumor from normal." He explains that, contrary to the PTO's assertions, "[t]he 
precise levels of gene expression are irrelevant; what matters is that there is a relative difference 
in expression between normal tissue and tumor tissue." First Grimaldi Declaration at ^ 7. 

This declaration makes clear that since it is the relative level of expression between 
normal tissue and suspected cancerous tissue that is important, how high the level of expression 
in normal tissue is, is irrelevant. As to the Examiner's questions about the reliability of the 
results, Appellants employed standard techniques which are well-known and accepted by those of 
skill in the art. Mr. Grimaldi states that if a difference is detected using these techniques, "this 
indicates that the gene and its corresponding polypeptide and antibodies against the polypeptide 
are useful for diagnostic purposes..." Id. 

In addition, the Grimaldi Declaration provides further facts relating to Example 18 by 
stating that "DNA libraries used in the gene expression studies were made from pooled samples 
of normal and of tumor tissues." Id, at \ 5. Mr. Grimaldi then explains the significance of this 
fact by stating that "[d]ata from pooled samples is more likely to be accurate than data obtained 
from a sample from a single individual." Id. Appellants are not required to prove utility to a 
statistical certainty, only that it is more likely than not true. See Nelson v. Bowler, 626 F.2d 853, 
856-57, 206 U.S.P.Q, 881, 883-84 (C.C.P.A. 1980) (reversing the Board and rejecting an 
argument that evidence of utility was insufficient because it was not statistically significant). 
Therefore, whether the results are statistically certain or not is irrelevant to establishing the 
asserted utility. Thus, it is the uncontested opinion of an expert in the field that the results are 
reliable enough to indicate that the claimed polypeptides are useftil as diagnostic tools. 

Hu and LaBaer are cited by the Examiner as countervailing evidence to show that the 
significance of the disclosed analysis of PR0874 mRNA expression in relation to PR0874 
polypeptide expression in cancer diagnosis or treatment is unknown. Office Action at 6-8. The 
Examiner cites Hu for the proposition that "the literature cautions researchers from drawing 
conclusions based on small changes in transcript expression levels between normal and 
cancerous tissue." Id. at 6. The Examiner also quotes from the editorial by LaBaer, stating in 
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part that "reports of mRNA or protein changes of as little as two-fold are not uncommon,... most 
are attributable to disease-independent differences between samples." Id at 8. The Examiner, 
concludes that "[i]t cannot be determined if the difference, if any, of PR0874 mRNA expression 
between normal lung and lung tumor is significant or insignificant, disease-dependent or 
-independent, relevant or irrelevant." Id. 

In Hu, the researchers used an automated literature-mining tool to summarize and 
estimate the relative strengths of all human gene-disease relationships published on Medline. 
They then generated a microarray expression dataset comparing breast cancer and normal breast 
tissue. Using their data-mining tool, they looked for a correlation between the strength of the 
literature association between the gene and breast cancer, and the magnitude of the difference in 
expression level. They report that for genes displaying a 5-fold change or less in tumors 
compared to normal, there was no evidence of a correlation between altered gene expression and 
a known role in the disease. See Hu at 411. However, among genes with a 10-fold or more 
change in expression level, there was a strong correlation between expression level and a 
published role in the disease. Id. at 412. Importantly, Hu reports that the observed correlation 
was only found among estrogen receptor-positive tumors, not ER-negative tumors. Id, 

The general findings of Hu are not surprising - one would expect that genes with the 
greatest change in expression in a disease would be the first targets of research, and therefore 
have the strongest known relationship to the disease as measured by the number of publications 
reporting a connection with the disease. The correlation reported in Hu only indicates that the 
greater the change in expression level, the more likely it is that there is a published or known role 
for the gene in the disease, as found by their automated literature-mining software. Thus, Hu's 
results merely reflect a bias in the literature toward studying the most prominent targets, and 
reflect nothing regarding the ability of a gene that is 2-fold or more differentially expressed in 
tumors to serve as a disease marker. 

Hu acknowledges the shortcomings of this method in explaining the disparity in Hu's 
findings for ER-negative versus ER-positive tumors: Hu attributes the "bias in the literature" 
toward the more prevalent ER-positive tumors as the explanation for the lack of any correlation 
between number of publications and gene expression levels in less-prevalent (and, therefore, less 
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studied) ER-negative tumors. Id, Because of this intrinsic bias, Hu's methodology is unlikely to 
ever note a correlation of a disease with less differentially-expressed genes and their 
corresponding proteins, regardless of whether or not an actual relationship between the disease 
and less differentially-expressed genes exists. Accordingly, Hu's methodology yields results that 
provide little or no information regarding biological significance of genes with less than 5-fold 
expression change in disease. Nowhere in Hu does it say that a lack of correlation in their study 
means that genes with a less than five-fold change in level of expression in cancer cannot serve 
as a molecular marker of cancer. 

Appellants submit that a lack of known role for the PR0874 gene in cancer does not 
prevent it from being useful as diagnostic tools for cancer. There is a difference between use of a 
gene for distinguishing between tumor and normal tissue on the one hand, and establishing a role 
for the gene in cancer on the other. Genes with lower levels of change in expression may or may 
not be the most important genes in causing the disease, but the genes can still show a consistent 
and measurable change in expression. While such genes may or may not be good targets for 
further research, the encoded polypeptides and antibodies which bind to them can nonetheless be 
used as diagnostic tools. Thus, Hu does not refute the Appellants' assertion that the PR0874 
gene and the claimed polypeptides can be used as a cancer diagnostic tools because the gene and 
polypeptide are differentially expressed in certain tumors. 

Contrary to the Examiner's assertion that one must know what role a gene or polypeptide 
plays in cancer for it to have utility, the PTO's own written policies recognize that the utility of a 
nucleic acid does not depend on the function of the encoded gene product. The Utility 
Examination Guidelines published on January 5, 2001 state: "In addition, the utility of a claimed 
DNA does not necessarily depend on the function of the encoded gene product. A claimed DNA 
may have a specific and substantial utility because, e.g. it hybridizes near a disease-associated 
gene or it has a gene regulating activity." (Federal Register, Volume 66, page 1095, Comment 
14). Similarly, here the disclosed nucleic acids, as well as the encoded polypeptides and related 
antibodies, are useful for determining whether an individual has cancer regardless of whether or 
not they eire the cause of the cancer. 
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The position of the Examiner requiring a known role for PR0874 in cancer for utility is 
also inconsistent with the analogous standard for therapeutic utility of a compound where "the 
mere identification of a pharmacological activity of a compound that is relevant to an asserted 
pharmacological use provides an 'immediate benefit to the public' and thus satisfies the utility 
requirement." M.P.E.P. §2701.01 (emphasis original). Here, the mere identification of altered 
expression in tumors is relevant to diagnosis of tumors, and, therefore, provides an immediate 
benefit to the public. 

As mentioned above, the Examiner also quotes a statement from LaBaer that "reports of 
mRNA or protein changes of as little as two-fold are not uncommon, and although some changes 
of this magnitude turn out to be important, most are attributable to disease-independent 
differences between samples." Office Action at 8. Apparently based on Hu and LaBaer, the 
Examiner states that it cannot be determined if the difference in PR0874 mRNA expression is 
"significant or insignificant, disease-dependent or -independent, relevant or irrelevant." Id 

LaBaer is an unreviewed letter to the editor by an author of the Hu et al article describing 
the automated literature searching tool used in the Hu et al, reference discussed above. LaBaer 
provides no further evidence than that provided in Hu, and provides no basis to consider the 
results of Hu applicable to diagnostic utility of differentially expressed genes. Accordingly, 
LaBaer suffers from the same defects discussed above with respect to Hu et al In particular, the 
biological correlations referred to in LaBaer are correlations with a published or known role for 
the gene in the disease. As discussed above, with respect to Hu et al, the results retrieved by the 
automated literature searching tool of Hu as described in LaBaer reflect a bias in the literature 
toward studying the most prominent targets, and reflect nothing regarding the ability of a gene 
that is differentially expressed in tumors to serve as a disease marker. 

Importantly, Hu and LaBaer both discuss results from microarray studies, a technique 
which is not as accurate as the RT-PCR method used by Appellants. See //w at 41 1-12; LaBaer 
at 976-6. Therefore their statements regarding the reliability of changes in mRNA levels 
assessed using microarrays is not applicable to the data presented in Example 18, which were 
generated using the more accurate and reliable method of RT-PCR. In addition, both references 
are silent regarding the reliability of pooled samples of human tissue, and whether or not 
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differential expression in pooled samples are susceptible to disease-independent differences 
between samples. The Examiner's concern that it cannot be determined if the differences in 
PR0874 mRNA expression "are significant or insignificant, disease-dependent or -independent, 
relevant or irrelevant" is addressed by the fact that the data in Example 18 are RT-PCR data, and 
by the statement in the first Grimaldi Declaration that "DNA libraries used in the gene expression 
studies were made from pooled samples of normal and of tumor tissues. Data from pooled 
samples is more likely to be accurate than data obtained from a sample from a single individual." 
First Grimaldi Declaration at T| 5. 

The Examiner provides no basis for dismissing these statements by Grimaldi. Hu and 
LaBaer cannot support the Examiner's dismissal of the first Grimaldi Declaration because both 
are based on studies using microarrays, not PCR, and both are silent regarding the use of 
differentially expressed genes as diagnostic tools in general, and the reliability of pooled samples 
in particular. Hu and LaBaer say nothing about whether or not differential expression data 
generated by PCR analysis from pooled samples are susceptible to disease-independent 
differences between samples and the Examiner has not offered any arguments or evidence to 
counter Grimaldi's statements. Therefore, Hu and LaBaer do not provide a basis for doubting 
Appellants' differential expression data. As such, there is no evidence that one skilled in the art 
would question whether the differential expression of PR0874 mRNA in pooled samples was 
disease-dependent or disease-independent. 

The data in Example 18 and the first Grimaldi Declaration are therefore sufficient to 
establish the asserted utility, and the Examiner has not rebutted the presumption of utility that the 
Appellants' application is afforded. Mr. Grimaldi is an expert in the field who conducted or 
supervised the experiments at issue. His declaration is based on personal knowledge of the 
relevant facts at issue. Appellants' have reminded the Examiner that "Office personnel must 
accept an opinion from a qualified expert that is based upon relevant facts whose accuracy is not 
being questioned." M.P,E.P. § 2107 (emphasis added). In addition, declarations relating to 
issues of fact should not be summarily dismissed as "opinions" without an adequate explanation 
of how the declaration fails to rebut the Examiner's position. See in re Alton 76 F.3d 1 168 (Fed. 
Cir. 1996). 
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The Examiner has offered no reason or evidence to reject either the underlying data or the 
conclusions of the Grimaldi Declaration, other than the cited Hu and LaBaer references which are 
not contrary to Appellants' assertion. It is important to note that nowhere in Hu or LaBaer does 
it say that a lack of correlation in their study means that genes with a less than five-fold change in 
level of expression in cancer cannot serve as a molecular marker of cancer. Likewise, Hu and 
LaBaer are silent regarding the reliability of pooled samples. As Hu and LaBaer are the only 
evidence cited by the Examiner to support his assertion that the data in Example 1 8 are unreliable 
and insufficient to support Appellants' asserted utility, the Examiner is left with no basis to 
question the conclusion of the first Grimaldi Declaration or Appellants' assertions. Therefore, 
the Examiner must accept Mr. Grimaldi 's opinion with regard to his statement that the genes of 
interest "can be used to differentiate tumor from normal." 

Although it is only briefly mentioned in the most recent Office Action, the Examiner 
previously relied on Wang et al (Trends in Pharmacol. Sci. 1996; 17(8):276-9) to support his 
argument that Appellants have not sufficiently validated the role of PR0874 gene in disease 
processes. Office Action at 8. In Wang, the authors outline a strategy for using mRNA 
differential display for the discovery of novel pharmacological targets. They state that the use of 
mRNA differential display for the isolation of novel genes associated with disease processes will 
no doubt facilitate discovery of novel pharmaceutical targets. See Wang at 279. However, they 
state that it is the first of many steps in the process, and that characterization of the functions of 
the gene as well as validation of the importance of the gene in disease processes is required. Id 

Nothing in Wang is contrary to Appellants' assertion that differentially expressed genes 
and polypeptides can be used as molecular markers of cancer. Wang speaks only of the 
additional steps needed to develop a novel pharmacological target . Contrary to the Examiner's 
assertions, nowhere does Wang teach that differentially expressed genes and polypeptides cannot 
be used as molecular markers for cancer. Thus, Wang does not refute the Appellants' assertion 
that the PR0874 gene, polypeptide, and antibodies can be used as cancer diagnostic tools 
because they are differentially expressed in lung tumors. 

In conclusion, Appellants submit that the evidence reported in Example 18, which is 
supported by the first Grimaldi Declaration, establish that there is at least a two-fold difference in 
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PR0874 mRNA between lung tumor tissue and normal lung tissue, and that the data in Example 
18 are significant and reliable enough to support the assertion of utility for PR0874 as a 
diagnostic tool for lung cancer. The Examiner has not offered any significant arguments or 
evidence to the contrary, and therefore has not established a prima facie case that one of skill in 
the art would reasonably doubt the asserted utility of the PR0874 mRNA, or, as explained 
below, the PR0874 polypeptide. 

b. The references cited by the Examiner do not refute Appellants* assertion 
that a change in mRNA levels leads to a corresponding chanse in the 
level of the encoded protein 

Appellants turn next to the Examiner's second argument based on Haynes and Gygi that 
protein expression levels are not predictable from the mRNA expression levels. Id. at 12-14. 
The Examiner also relies on Allman et al, arguing that it provides an example of art where 
mRNA expression and protein expression do not correlate. Id. at 15, For the reasons discussed 
below, none of these references are contrary to Appellants' assertion that generally speaking, 
changes in mRNA levels lead to corresponding changes in the level of polypeptide. 

Haynes studied whether there is a correlation between the level of mRNA expression and 
the level of protein expression for 80 selected genes from yeast. The genes were selected 
because they constituted a relatively homogeneous group with respect to predicted half-life and 
expression level of the protein products. See Haynes at 1 863. Haynes did not examine whether a 
change in transcript level for a particular gene led to a change in the level of expression of the 
corresponding protein. Instead, Haynes determined whether the steady-state transcript level 
correlated with the steady-state level of the corresponding protein based on an analysis 
comparing levels of mRNA and protein across 80 different genes. The Examiner also relies on 
Gygi et al, a study on which the Haynes reference is based. Like Haynes, the Gygi reference 
looked at static levels of mRNA across different genes, not changes in mRNA levels for a single 
gene. 

According to the Examiner, Haynes and Gygi teach "that protein levels cannot be 
accurately predicted from the level of the corresponding mRNA transcript and that there are 
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numerous levels of control of protein synthesis, degradation, processing and modification, which 
are only apparent by direct protein analysis." Office Action at 14. Based on these teachings, the 
Examiner concludes that "the skilled artisan would not know if or how PR0874 polypeptide 
expression would change in cancer." Id, 

Appellants emphasize that neither Haynes nor Gygi looked at whether a change in 
transcript level for a particular gene led to a change in the level of expression of the 
corresponding protein - neither reference examined differential expression of mRNA and 
proteins. Instead, these references considered whether the steady-state transcript level correlated 
with the steady-state level of the corresponding protein by comparing mRNA and protein levels 
across many different genes. These references disclose that similar mRNA levels for different 
genes did not universally result in equivalent protein levels for the different gene products, and 
that similar protein levels for different gene products did not universally result from equivalent 
mRNA levels for the different genes. These results are expected, since there are many factors 
that determine translation efficiency for a given transcript, or the half-life of the encoded protein. 
Not surprisingly, based on these results, Haynes and Gygi concluded that protein levels cannot 
always be accurately predicted from the level of the corresponding mRNA transcript when 
looking at the steady-state level of transcripts across different genes . 

Importantly, Haynes and Gygi did not say that for a single gene, changes in the level of 
mRNA transcript are not positively correlated with changes in the level of protein expression. 
Appellants have asserted that increasing or decreasing the level of mRNA for the same gene 
leads to a increase or decrease for the encoded protein - that is to say that differential mRNA 
expression leads to differential protein expression. Haynes and Gygi did not study this issue and 
say absolutely nothing about it. One cannot look at the level of mRNA across several different 
genes to investigate whether a change in the level of mRNA for a particular gene leads to a 
change in the level of protein for that gene. 

Haynes and Gygi have nothing to do with changes in mRNA levels resulting in changes 
in protein levels because they only examined static levels of mRNA and protein compared across 
different genes. Thus, when these references assert that protein levels cannot be accurately 
predicted from mRNA levels, they are referring to the finding that similar steady-state levels of 
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mRNA for different genes can lead to the different levels of protein for the corresponding gene 
products, and that the similar steady-state level of protein for different genes can result from 
different levels of the corresponding mRNAs. This is completely unrelated to the assertion that a 
change in mRNA levels leads to a corresponding change in the level of the encoded protein . 

Neither Haynes nor Gygi provide any insight whatsoever into the effects on protein level 
caused by a change in the encoding mRNA level. The Examiner has not provided any evidence 
or reasons w^hich support extending the findings of Haynes and Gygi regarding steady-state levels 
of mRNA for different genes to reject Appellants' assertions regarding changes in mRNA levels 
for the same gene. While Appellants acknowledge that protein levels are regulated at numerous 
levels, including post-translational mechanisms, Haynes and Gygi only provide insight on how 
these mechanisms lead to differences in protein levels across different genes. They do not 
address the Appellants' assertion that generally, changes in mRNA level for a particular gene 
lead to changes in the level of the encoded protein. Since this issue is not addressed in Haynes 
and Gygi, these references offer no support for the Examiner's rejection of Appellants' asserted 
utility. 

Finally, while the Examiner acknowledges that "there are examples in the art where 
mRNA expression and protein expression correlate," the Examiner relies on a single reference, 
Allman et al (Blood, 87(12):5257-68 (1996)), to support his statement that "[hjowever, there are 
examples where they do not correlate." Id. at 15. The Examiner focuses on the finding reported 
by Allman et al that germinal center B cells express dramatically more BCL-6 protein than 
resting B cells, despite similar BCL-6 mRNA levels in the two cell populations. 

This finding is not contrary to Appellants assertion that changes in mRNA level result in 
corresponding changes in protein level. Appellants have not asserted that every change in protein 
level is the result of a change in mRNA level, and therefore an observed change in protein level 
without a corresponding change in mRNA level is not contrary to Appellants assertion. 
Therefore, the increase in protein level observed in Allman in the absence of an increase in 
mRNA levels does not provide any evidence as to whether Appellants' assertion that a change in 
mRNA level will lead to a corresponding change in the encoded protein. 
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In addition, the Allman reference supports Appellants' assertion that increased mRNA 
levels lead to increased protein levels. In the discussion of their findings they state that "an 
unanticipated finding was that the higher BCL-6 protein levels... could not be fully accounted for 
by increased mRNA expression." Allman at 5267, column 1 , carryover paragraph (emphasis 
added). This statement indicates that one of skill in the art accepts that increased mRNA levels 
lead to increased protein levels, and in addition, that this is the expected or most likely reason for 
observed changes in protein levels. 

Even if the results in Allman supported the Examiner's argument, which they do not as 
discussed above, one contrary example does not establish that one of skill in the art would 
believe that changes in mRNA level do not generally result in changes in protein level, especially 
in light of the more likely than not standard. As shown by the declarations, references, and 
textbooks discussed below, Appellants submit the working hypothesis among those skilled in the 
art is that changes in mRNA level generally lead to a corresponding change in protein level. 

c. No further research is required to establish the usefulness of the claimed 
polypeptides under the utility standards of patent law 

Finally, the Examiner asserts that even in cases where genes are differentially expressed 

in cancer, the art teaches that further experimentation is necessary to establish the usefulness of 

these genes and gene products for diagnosis, prognosis and treatment of cancer, citing for support 

Yousef (Cancer. Res. 2003; 63(9):2223-7). Office Action at 9. In particular, the Examiner points 

to the portion of Yousef that states: 

In conclusion, here we provide strong evidence suggesting that seven KLKs are 
up-regulated in ovarian cancer. These data were confirmed by examining several 
databases, in addition to protein analysis of KLKs in tissue cytosols. Further 
experimentation is necessary to establish the usefulness of these KLKs for 
diagnosis, prognosis, and treatment of ovarian cancer. Yousef at 2226, right 
column. 

Based on this statement, the Examiner asserts: 

Therefore, Yousef, which discloses more about a potential cancer diagnostic, 
prognostic, or treatment than the present specification discloses about PR0874, is 
consistent with and supports the examiner's position that the present specification 
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fails to disclose enough information about the invention to make its usefulness 
immediately apparent to those familiar with the technological field of the 
invention, and that the present disclosure is simply a starting point for further 
research and investigation into potential practical uses of the claimed invention. 
Hence, in the present case the characterization of PR0874 mRNA as being "more 
highly expressed in" normal lung as compared to lung tumor does not provide an 
immediate benefit to the public for the claimed PR0874 polypeptide. Office 
Action at 10. 

Thus, the Examiner, instead of using a standard for evaluating the utility of claims 
according to the Utility Guidelines and the guidance provided by the courts, has elected a 
st2indard for evaluating utility of claimed subject matter supported by differential mRNA 
expression data as provided by the publication of Yousef. Appellants submit that a description of 
"usefulness" in a scientific publication does not substitute for a determination of the utility of 
claimed subject matter under 35 U.S.C. §101. The fact that terms such as "usefulness" can take 
on different meanings in different contexts has been addressed by the courts, which have 
emphasized that utility is to be evaluated according to the requirements under the law for 
obtaining a patent, and not under some different standard. For example, when the PTO 
previously held that preclinical tests were insufficient to establish utility, the Federal Circuit 
admonished: 

The Commissioner, as did the Board, confiises the requirements under the law for 
obtaining a patent with the requirements for obtaining government approval to 
market a particular drug for human consumption. See Scott v. Finney, 34 F.3d 
1058, 1063, 32 U.S.P.Q.2D (BNA) 1 1 15, 1 120 (Fed. Cir. 1994) ("Testing for the 
fiill safety and effectiveness of a prosthetic device is more properly left to the 
Food and Drug Administration (FDA). Title 35 does not demand that such human 
testing occur within the confines of Patent and Trademark Office (PTO) 
proceedings."). In re Brana, 51 F.3d 1560, 1567, 34 U.S.P.Q.2d 1436 (Fed. Cir. 
1995). 

The Federal Circuit then explained 

FDA approval, however, is not a prerequisite for finding a compound useful 
within the meaning of the patent laws. Usefulness in patent law, and in particular 
in the context of pharmaceutical inventions, necessarily includes the expectation 
of further research and development. The stage at which an invention in this field 
becomes usefiil is well before it is ready to be administered to humans. Were we 
to require Phase II testing in order to prove utility, the associated costs would 



-28- 



Appl. No. : 10/063,514 

Filed : May 1, 2002 



prevent many companies from obtaining patent protection on promising new 
inventions, thereby eliminating an incentive to pursue, through research and 
development, potential cures in many crucial areas such as the treatment of 
cancer. Id. at 1568. 

According to the Utility Guidelines and the courts, the utility standard in patent law 
derives from the Supreme Court's holding in Brenner v. Manson, 383 U.S. 519, 534 (1966) 
stating that "[t]he basic quid pro quo contemplated by the Constitution and the Congress for 
granting a patent monopoly is the benefit derived by the public from an invention with 
substantial utility." In explaining the "substantial utility" standard, M.P.E.P. § 2107.01 cautions, 
however, that Office personnel must be careful not to interpret the phrase "immediate benefit to 
the public" or similar formulations used in certain court decisions to mean that products or 
services based on the claimed invention must be "currently available" to the public in order to 
satisfy the utility requirement. "Rather, any reasonable use that an applicant has identified for the 
invention that can be viewed as providing a public benefit should be accepted as sufficient at 
least with regard to defining a 'substantial' utility." M.P.E.P, § 2107.01, (emphasis added). 

There is no basis for concluding that Yousef s standard for "usefulness" is in accord with 
Brenner v. Manson or In re Brana or the PTO's Utility Guidelines. In fact, there is no basis 
whatsoever for determining how Yousef concludes what is useful and what is not useful. 
Further, the publication by Yousef itself is inconsistent in describing what is "useful." The 
above-quoted text of Yousef refers to seven KLKs, termed hK5, hK6, hK7, hK8, hKlO, hKl 1 
and hKl 4. Yousef teaches that "hK6, hKlO, and hKl 1 are useful serological diagnostic markers 
for ovarian cancer." Yousef at 2226, left column. However, as the Examiner has noted, Yousef 
elsewhere states that for hK5, hK6, hK7, hK8, hKlO, hKl 1 and hK14, "further experimentation 
is necessary to establish the usefulness of these KLKs for diagnosis, prognosis, and treatment of 
ovarian cancer." Yousef at 2226, right column. Yousef provides no guidance for how hK6, 
hKlO, and hKll are useful serological diagnostic markers while still requiring "further 
experimentation ... to establish the usefulness of these KLKs for diagnosis ... of ovarian cancer." 
The inconsistency of Yousef itself underscores the fact that Yousef is not an authority for 
establishing "the requirements under the law for obtaining a patent." Determining utility by 
picking and choosing one of the varying standards provided by Yousef for finding something 
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useful cannot substitute for a determination of whether or not claimed subject matter has utility 
under the patent laws. Instead, utility of the claimed polypeptides must be determined according 
to the requirements of 35 U.S.C. §101 as elaborated by the courts, and not by an inconsistent, 
ambiguous standard provided in a scientific publication. 

Appellants submit that, in contrast with the Examiner's reliance on Yousef, an assertion 
of diagnostic utility based on differential expression is consistent with the Revised Interim Utility 
Guidelines promulgated by the PTO, which recognize that proteins which are differentially 
expressed in cancer have diagnostic utility. The caveat in Example 12 of the Revised Interim 
Utility Guidelines Training Materials states that the utility requirement is satisfied where a 
protein is expressed on melanoma cells but not on normal skin, and that antibodies against the 
protein can be used to diagnose cancer. Therefore, the Revised Interim Utility Guidelines make 
clear that differential expression that is diagnostic of cancer provides an inmiediate benefit to the 
public. 

The Examiner has stated that the caveat in Example 12 is not applicable to the present 
application: 

[BJecause unlike the situation wherein the specification discloses that receptor A 
is present on the cell membranes of melanoma cells but not on the cell membranes 
of normal skin cells, in the present case Applicants rely on a qualitative 
comparison of PR0874 mRNA expression between tumor tissue and normal 
samples in order to establish utility for the presently claimed PR0874 
polypeptide. However, the present specification does not teach the level of 
reproducibility or the level of reliability of the results seen in Example 18. The 
skilled artisan would not know if these differences were disease-dependent or 
disease-independent. Furthermore, Applicants have not provided any testing of 
the expression, role, or activity of the PR0874 polypeptide. Office Action at 19. 

This standard is not consistent with the caveat in Example 12, which only states that "the 
specification also discloses that receptor A is present on the cell membranes of melanoma cells 
but not on the cell membranes of normal skin cells." Revised Interim Utility Guidelines Training 
Materials at 69. There is no statement in the caveat of Example 12 that the specification, in 
addition to disclosing differential expression of receptor A, also discloses "the level of 
reproducibility or the level of reliability of the results" as required by the Examiner in the present 
application. There is no indication in Example 12 that statistical support for the observed 
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differential expression must be established by providing, for example, the level of reproducibility 

or the level of reliability of the results. In fact, it is inappropriate to require statistical certainty of 

the disclosed evidence in order to establish utility, as provided in the M.P.E.P.: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a whole, 
it leads a person of ordinary skill in the art to conclude that the asserted utility is 
more likely than not true . M.P.KP. at § 2107.02, part VII (2004) (emphasis in 
original). 

In the instant case, the Examiner asserts that Appellants' disclosure, because it does not 
provide "the level of reproducibility or the level of reliability of the results," is insufficient to 
establish utility for the presently claimed polypeptides. This is inconsistent with the Utility 
Guidelines and the M.P.E.P., and is contrary to the first Grimaldi Declaration which states that 
the data are sufficiently reliable such that if a difference is detected, "this indicates that the gene 
and its corresponding polypeptide and antibodies against the polypeptide are useful for diagnostic 
purposes..." First Grimaldi Declaration at ^ 7. Accordingly, insofar as the utility rejection of 
the claims is based on the reasoning that the specification must disclose the level of 
reproducibility or the level of reliability of the results, the basis for the utility rejection is 
unfounded. 

d. Conclusion - Examiner has failed to establish a prima facie case that 
one of skill in the art would doubt Appellants' asserted utility 

The Examiner has questioned the sufficiency, reliability and significance of the data 
reported in Example 18 as well as the supporting first Grimaldi Declaration, relying on Hu et al 
and LaBaer for support. The Examiner relies on Wang to support his position that Appellants 
have not sufficiently validated the PR0874 polypeptide. To support his assertion that 
polypeptide levels cannot be accurately predicted from mRNA levels, the Examiner relies on 
Haynes et al., Gygi et al, and Allman. Finally, the Examiner relies on Yousef et al to support 
his assertion that further research is required to establish utility for the claimed invention. 
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Appellants submit that these arguments and supporting references fail to establish a prima facie 
case that one skilled in the art would reasonably doubt Appellants' asserted utility. 

First, Appellants have shown that the data in Example 18 are sufficiently significant and 
reliable to establish that PR0874 is useful as a cancer diagnostic tool. This assertion is 
supported by the first Grimaldi Declaration. The Examiner has not provided any substantial 
reason or evidence for one of skill in the art to doubt the reliability or useflilness of Example 1 8, 
or the facts and conclusions in the first Grimaldi Declaration. In particular, Appellants maintain 
Hu e( al. and LaBaer do not contradict Appellants' position because Hu et al and LaBaer focus 
on the role of polypeptides in cancer, not whether differentially expressed mRNAs can be used as 
diagnostic markers. In addition, the statements of Hu and LaBaer relate to microarray data, not 
PGR data as disclosed in the instant invention, and neither reference addresses the reliability of 
pooled samples. Therefore they cannot be relied on to counter the expert opinions expressed in 
the first Grimaldi Declaration that the differential expression of PR0874 makes it useful as a 
diagnostic tool. Similarly, Appellants maintain that Wang relates to the development and 
validation of novel pharmacological targets, not diagnostic tools. Therefore, like Hu and LaBaer, 
Wang is simply not relevant, and does not support the rejection of Appellants' asserted utility. 

The Examiner also relies on Haynes et al, Gygi et al, and Allman. Appellants have 
shown that the Haynes and Gygi references are simply not relevant to the issue of whether a 
change in mRNA levels leads to a corresponding change in the level of the encoded protein since 
the references examined steady-state levels of mRNA and protein across different genes. Allman 
is also not contrary to Appellants' assertion that a change in mRNA level leads to a 
corresponding change in protein level because Allman reports that mRNA levels were unchanged 
while protein levels increased. Appellants have not asserted that changing mRNA levels is the 
only way to increase protein levels, and therefore the results in Allman are not contrary to 
Appellants' asserted utility. In addition, the statement in Allman that it was unanticipated that 
the change in protein level could not be accounted for by a change in mRNA supports 
Appellants' assertion that changes in mRNA level lead to changes in protein level. 

Finally, the Examiner relies on Yousef for the proposition that further research is required 
to establish the usefulness of the claimed invention. Appellants submit that the standard for 
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"usefulness" under 35 U.S.C. § 101 is established by the PTO and the courts. Therefore, any 
conclusory statement regarding "usefulness" by Yousef is irrelevant to whether the claimed 
invention meets the requirements of utility under the patent laws. In addition, Yousef s standard 
for "usefulness" is inconsistent, as elsewhere in the article the author states that the molecules in 
question are useful as diagnostic tools. 

Taken together, the Examiner's arguments are not sufficient to satisfy the Examiner's 
burden to "provide [] evidence showing that one of ordinary skill in the art would reasonably 
doubt the asserted utility." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q,2d 1436 (Fed. Cir. 
1995). The Examiner's arguments are largely conclusory statements which are not supported by 
any substantial evidence or reasoning which explains why one of ordinary skill in the art would 
reasonably doubt the asserted utility. Therefore, the Board should accept the Appellants' 
disclosure of utility. See Ex parte Rubin, 5 U.S.P.Q, 2d 1461 (Bd. Pat. App. & Interf. 1987) 
("There is no factual support in this record for the examiner's questioning of the denaturation test 
reported in the specification. ... No reason to doubt 'the objective truth' of the asserted utility 
having been advanced by the examiner, we accept appellant's disclosure of utility corresponding 
in scope to the claimed subject matter."). 

7. Appellants have provided Sufficient Rebuttal Evidence of Utility 

"Only after the PTO provides evidence showing that one of ordinary skill in the art would 

reasonably doubt the asserted utility does the burden shift to the applicant to provide rebuttal 

evidence." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). The rebuttal 

evidence must be sufficient such that when it is considered as a whole, it is more likely than not 

that the asserted utility is true. See In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 

1444 (Fed. Cir. 1992) (stating that the evidentiary standard to be used throughout ex parte 

examination in setting forth a rejection is a preponderance of the evidence, or "more likely than 

nof standard). The M.P.E.P. summarizes the standard of proof required: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a whole, 
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it leads a person of ordinary skill in the art to conclude that the asserted utility is 
more likely than not true . M.P,KP, § 2107.02, part VII (emphasis in original, 
citations omitted). 

Appellants remind the Board that the Federal Circuit has stated that the standard for satisfying 
the utility requirement is a low one: "The threshold of utility is not high: An invention is 'useful' 
under section 101 if it is capable of providing some identifiable benefit." Juicy Whip, Inc. v. 
Orange Bang, Inc., 185 F.3d 1364, 1366, 51 U.S.P.Q. 2d 1700 (Fed. Cir. 1999). 

Even if the Examiner has satisfied his burden of presenting a prima facie case of lack of 
utility, Appellants have supplied more than enough rebuttal evidence, such that when considered 
as a whole, one of skill in the art would conclude that the asserted utility is more likely than not 
true. As discussed in detail below, Appellants have provided sufficient evidence that the gene 
encoding the PR0874 polypeptide is differentially expressed in lung tumors and can therefore be 
used as a diagnostic tool. In addition, Appellants have shown that it is well established in the art 
that there is a reasonable correlation between changes in mRNA level and changes in the 
corresponding protein level such that one of skill in the art would believe that the PR0874 
polypeptide is also differentially expressed in lung cancers. Therefore, considering the evidence 
as a whole, one of skill in the art would believe that it is more likely than not that the claimed 
polypeptides are useful as diagnostic tools for cancer, particularly lung cancer. 

a. Appellants have established that the sene encodin2 the PROS 74 
polypeptide is differentially expressed in certain cancers 

As discussed above, the Examiner has not provided any relevant evidence or reasoning to 
challenge the reliability and significance of the data in Example 18 which reports that the mRNA 
for PR0874 is more highly expressed in normal lung tissue compared to lung tumor tissue. In 
contrast to this complete lack of relevant evidence on the part of the Examiner, Appellants have 
submitted the first Grimaldi Declaration. That declaration establishes that it is the opinion of an 
expert in the field who has personal knowledge of the facts surrounding Example 1 8 that there is 
at least a two-fold difference in mRNA for PR0874 between the tumor tissue and the counterpart 
normal tissue, and that the PR0874 genes, polypeptides and antibodies are useful for 
differentiating tumor tissue from normal tissue. As discussed above, Hu e( al., LaBaer, and 
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Wang et al are not relevant to determining whether differentially expressed genes can be used as 
diagnostic tools for cancer. Therefore, the Examiner has not provided any relevant evidence or 
reasoning to challenge the facts and conclusions of the first Grimaldi Declaration in support of 
Example 18. 

Given the disclosure of Example 18 and the supporting first Grimaldi Declaration on the 
one hand, and the complete lack of any relevant evidence on the other, it is clear that considering 
the evidence as a whole, one of skill in the art would conclude that it is more likely than not that 
the PR0874 gene is differentially expressed in lung tumor tissue compared to normal lung tissue 
such that it is useful as a diagnostic tool to distinguish tumor tissue from normal tissue. 

As Appellants explain below, it is more likely than not that the PR0874 polypeptide is 
also differentially expressed in lung tumors, and can therefore be used to distinguish tumor tissue 
from normal tissue. 

b. Appellants have established that changes in mRNA expression levels 
generally lead to correspondins changes in the expression level of the 
encoded protein 

Appellants next tum to the second portion of their argument in support of their asserted 
utility - that it is well-established in the art that in most cases a change in the level of mRNA for 
a particular protein leads to a corresponding change in the level of the encoded protein. Given 
Appellants' evidence of decreased expression of the mRNA for the PR0874 polypeptide in lung 
tumor tissue compared to normal lung tissue, it is more likely than not that the PR0874 
polypeptide is likewise differentially expressed, and therefore the claimed polypeptides are useful 
as diagnostic tools, particularly for lung tumors. 

In support of the assertion that changes in mRNA are positively correlated to changes in 
protein levels, Appellants submitted a second Declaration by J. Christopher Grimaldi, an expert 
in the field of cancer biology (originally submitted on December 7, 2004 as Exhibit 2 with the 
Appellants' Amendment and Response to Office Action). As stated in paragraph 5 of the 
declaration, "Those who work in this field are well aware that in the vast majority of cases, when 
a gene is over-expressed... the gene product or polypeptide will also be over-expressed.... This 
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same principal applies to gene under-expression." Second Grimaldi Declaration at ^ 5. Further, 
"increased mRNA expression is expected to result in increased polypeptide expression, and the 
detection of decreased mRNA expression is expected to result in decreased polypeptide 
expression." Id, 

Appellants also submitted the declaration of Paul Polakis, Ph.D. an expert in the field of 

cancer biology (attached as Exhibit 3 to Appellants' Amendment and Response to Office Action 

submitted on December 7, 2004). As stated in paragraph 6 of his declaration: 

Based on my own experience accumulated in more than 20 years of research, 
including the data discussed in paragraphs 4 and 5 above [showing a positive 
correlation between mRNA levels and encoded protein levels in the vast majority 
of cases studied in relation to the present invention] and my knowledge of the 
relevant scientific literature, it is my considered scientific opinion that for human 
genes, an increased level of mRNA in a tumor cell relative to a normal cell 
typically correlates to a similar increase in abundance of the encoded protein in 
the tumor cell relative to the normal cell. In fact, it remains a central dogma in 
molecular biology that increased mRNA levels are predictive of corresponding 
increased levels of the encoded protein . Polakis Declaration at TI 6 (emphasis 
added). 

Dr. Polakis acknowledges that there are published cases where such a correlation does not exist, 
but states that it is his opinion, based on over 20 years of scienfific research, that "such reports 
are exceptions to the commonly understood general rule that increased mRNA levels are 
predictive of corresponding increased levels of the encoded protein." Polakis Declaration at ^ 6. 

The statements of Grimaldi and Polakis are supported by the teachings in Molecular 
Biology of the Cell, a leading textbook in the field (Bruce Alberts, et ai. Molecular Biology of 
the Cell (3'"* ed. 1994) (submitted on June 29, 2005, with Appellants' Submission filed with 
Request for Continued Examination as Exhibit 1, hereinafter "Cell 3'"*") and (4^^ ed. 2002) 
(submitted on June 29, 2005, with Appellants' Submission filed with Request for Continued 
Examination as Exhibit 2 hereinafter "Cell 4'^")). Figure 9-2 of Cell 3'^ shows the steps at which 
eukaryotic gene expression can be controlled. The first step depicted is transcriptional control. 
Cell 3^^ provides that "[fjor most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 9-2, only transcriptional control 
ensures that no superfluous intermediates are synthesized." Cell at 403 (emphasis added). In 
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addition, the text states that "Ahhough controls on the initiation of gene transcription are the 
predominant form of regulation for most genes , other controls can act later in the pathway from 
RNA to protein to modulate the amount of gene product that is made." Cell i'*^ at 453 (emphasis 
added). Thus, as established in Cell 3^^*, the predominant mechanism for regulating the amount 
of protein produced is by regulating transcription. 

In Cell 4'*^, Figure 6-3 on page 302 illustrates the basic principle that there is a correlation 
between increased gene expression and increased protein expression. The accompanying text 
states that "a cell can change (or regulate) the expression of each of its genes according to the 
needs of the moment - mosi obviously by controlling (he production of its mRNA^ Cell 4'^ at 
302 (emphasis added). Similarly, Figure 6-90 on page 364 of Cell 4^^ illustrates the path from 
gene to protein. The accompanying text states that while potentially each step can be regulated 
by the cell, '' the initiation of transcription is the most common point for a cell to regulate the 
expression of each of its genes ." Cell 4 at 364 (emphasis added). This point is repeated on 
page 379, where the authors state that of all the possible points for regulating protein expression, 
" fflor most genes transcriptional controls are paramount ." Cell 4^^ at 379 (emphasis added). 

Further support for Appellants' position can be found in the textbook. Genes VI, 
(Benjamin Lewin, Genes VI (1997)) (submitted on June 29, 2005, with Appellants' Submission 
filed with Request for Continued Examination as Exhibit 3) which states "having acknowledged 
that control of gene expression can occur at multiple stages, and that production of RNA cannot 
inevitably be equated with production of protein, it is clear that the overwhelming majority of 
regulatory events occur at the initiation of transcription ." Genes VI at 847-848 (emphasis added). 

Additional support is also found in Zhigang et al. World Journal of Surgical Oncology 
2:13, 2004 (submitted on June 29, 2005, with Appellants' Submission filed with Request for 
Continued Examination as Exhibit 4). Zhigang studied the expression of prostate stem cell 
antigen (PSCA) protein and mRNA to validate it as a potential molecular target for diagnosis and 
treatment of human prostate cancer. The data showed "a high degree of correlation between 
PSCA protein and mRNA expression" Zhigang at 4. Of the samples tested, 81 out of 87 showed 
a high degree of correlation between mRNA expression and protein expression. The authors 
conclude that "it is demonstrated that PSCA protein and mRNA overexpressed in human prostate 
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cancer, and that the increased protein level of PSCA was resulted from the upregulated 

transcription of its mRNA." Id. at 6. Even though the correlation between mRNA expression 

and protein expression occurred in 93% of the samples tested, not 100%, the authors state that 

"PSCA may be a promising molecular marker for the clinical prognosis of human Pea and a 

valuable target for diagnosis and therapy of this tumor." Id. at 7. 

Further, Meric et al. Molecular Cancer Therapeutics, vol. 1, 971-979 (2002), (submitted 

on June 29, 2005, with Appellants' Submission filed with Request for Continued Examination as 

Exhibit 5), states the following: 

The fundamental principle of molecular therapeutics in cancer is to exploit the 
differences in gene expression between cancer cells and normal cells... [M]ost 
efforts have concentrated on identifying differences in gene expression at the level 
of mRNA, which can be attributable to either DNA amplification or to differences 
in transcription. Meric et al at 971 (emphasis added). 

Exploiting differences in gene expression between cancer cells and normal cells would not be a 
"fundamental principle" of molecular cancer therapeutics if there were no significant correlation 
between gene expression and protein levels. Stated another way, changes in mRNA without 
corresponding changes in protein levels would have little or no effect on cellular biology, and 
those of skill in the art would have no reason to examine the differences in gene expression at the 
mRNA level without such a correlation. However, as one of skill in the art recognizes, there is a 
strong correlation between changes in mRNA and changes in protein level. It is because of this 
strong correlation that it remains a "fundamental principle" of molecular therapeutics in cancer to 
look at changes in mRNA level. 

Finally, Appellants note that Yousef et al, cited by the Examiner in the Office Action 
dated October 4, 2005, also supports Appellants' assertion. Yousef reports that they found seven 
kallikrein (KLK) genes were up-regulated in ovarian cancer compared to normal ovarian tissue. 
Yousef at Abstract. Yousef further investigated this overexpression of KLK mRNA by 
examining the corresponding protein expression of six of the seven overexpressed KLK genes. 
As the Examiner has acknowledged, Yousef reports that they "experimentally verified the 
overexpression of six KLK proteins in cancer versus normal or benign tissues." Id. 
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Together, the expert declarations of Grimaldi and Polakis, the accompanying references, 
and the excerpts and references discussed above all establish that the accepted understanding in 
the art is that there is a reasonable correlation between changes in gene expression and changes in 
the level of the encoded protein. In contrast to this substantial amount of evidence supporting 
Appellants' position, the Examiner has cited two references, Haynes et al and Gygi et al. which 
examined the same data, as well as Allman ei al. However, as discussed above, Haynes et al, 
Gygi et al are not relevant to the issue of whether a change in mRNA levels leads to a change in 
the level of the corresponding protein since both Haynes and Gygi looked only at steady-state 
mRNA and protein levels across different genes, and Allman reported a change in protein level 
in the absence of a change in mRNA level, not a change in mRNA level with no resulting change 
in protein level. Finally, as discussed above, the standard of "usefulness" discussed in Yousef is 
not the standard for utility under the patent laws, and the data in Yousef actually support 
Appellants' utility by reporting a correlation between changes in mRNA levels and the 
corresponding protein level. It is clear that when considered as a whole, the preponderance of the 
evidence clearly weighs in favor of Appellants. 

Appellants have presented sufficient evidence to establish that the mRNA for PR0874 is 
differentially expressed in lung tumor tissue compared to its normal tissue counterpart, and that it 
is more likely than not that this leads to differential expression of the PR0874 polypeptide. This 
makes the claimed polypeptides useful for diagnosing cancer, particularly lung cancer. Given the 
overwhelming amount of evidence in support of Appellants' position, and the absence of any 
evidence in support of the Examiner's position, when considered as a whole the evidence leads a 
person of ordinary skill in the art to conclude that the asserted utility is more likely than not true. 

c. The asserted utility is specific 

Finally, Appellants address the Examiner's assertion that the asserted utilities are not 
specific to the claimed polypeptides. 

Specific utility is defined as utility which is "specific to the subject matter claimed," in 
contrast to "a general utility that would be applicable to the broad class of the invention." 
M.P.E.P. § 2107.01 I. Appellants submit that the evidence of differential expression of the 
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PR0874 gene and polypeptide in certain types of tumor cells, along with the declarations and 
references discussed above, provide a specific utility for the claimed polypeptides. 

As discussed above, there are significant and reliable data which show that the gene for 
the PR0874 polypeptide is expressed at least two-fold higher in normal lung tissue compared to 
lung tumor tissue. These data are strong evidence that the PR0874 gene and polypeptide are 
associated with lung tumors. Thus, contrary to the assertions of the Examiner, Appellants have 
provided evidence associating the PR0874 gene and polypeptide with a specific disease. The 
asserted utility for the claimed polypeptides as diagnostic tools for cancer, particularly lung 
cancer, is a specific utility - it is not a general utility that would apply to the broad class of 
polypeptides. 

8. The Examiner's Response to Appellants ' Evidence is Insufficient to Rebut 
Appellants 'Arguments 

The Examiner has stated that the Grimaldi and Polakis declarations and other references 
cited by Appellants are insufficient to overcome the rejection of claims based on 35 U.S.C. 
§§ 101. 

a. The Examiner ^s response to the First Grimaldi Declaration 
Appellants have relied on the first Grimaldi Declaration to further explain that the semi- 
quantitative analysis employed to generate the data of Example 18 is sufficient to determine if a 
gene is over- or under-expressed in tumor cells compared to corresponding normal tissue. The 
declaration states that any visually detectable difference seen between two samples is indicative 
of at least a two-fold difference in cDNA between the tumor tissue and the counterpart normal 
tissue, and that the results of the gene expression studies indicate that the genes of interest "can 
be used to differentiate tumor from normal." First Grimaldi Declaration at ^ 7. 
The Examiner has rejected the first Grimaldi Declaration, arguing that: 

All polynucleotides/polypeptides from a particular tumor can invariably be 
classified as either more highly expressed, less expressed, or unchanged as 
compared to some standard level of expression. It can then be asserted that any 
protein/polynucleotide that is expressed in this manner can be used to detect or 
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characterize the tumor. Such utilities are analogous to the assertion that a 
particular protein can be employed as a molecular weight marker, which is neither 
a specific or substantial utility. Final Office Action at 4-5 (emphasis added). 

This response fails to address the opinions expressed in the first Grimaldi Declaration. 
Neither the Appellants nor the first Grimaldi Declaration are asserting that "non-existent, or 
unchanged" expression provides utility for a gene or encoded polypeptide. This is made clear by 
the first Grimaldi declaration which states, "Example 1 8 reports the results of PGR analysis 
conducted as part of the investigating of several newly discovered DNA sequences.... The 
analyses were designed to determine whether a difference exists between gene expression in 
normal tissues as compared to tumor in the same tissue type. First Grimaldi Declaration at ^ 4 
(emphasis added). The declaration continues: "In differential gene expression studies, one looks 
for genes whose expression levels differ significantly under different conditions, for example, in 
normal versus diseased tissue." Id at TI 6 (emphasis added). Similarly, the declaration states that 
"the assay was designed to indicate whether a difference exists between normal tissue and tumor 
tissue of the same type. If a difference is detected , this indicates that the gene and its 
corresponding polypeptide and antibodies against the polypeptide are useful for diagnostic 
purposes, to screen samples to differentiate between normal and tumor." Id. at ^ 7. 

Appellants are asserting that because there is a difference in expression of the PROS 74 
mRNA in normal lung tissue compared to lung tumor tissue, the PR0874 nucleic acids have 
utility as diagnostic tools. In addition, as explained above, given the relationship between 
changes in mRNA level and changes in the level of the corresponding protein, one skilled in the 
art would expect PR0874 polypeptide expression to be differentially expressed in lung tumor 
tissue compared to the corresponding normal tissue as well. Therefore the claimed polypeptides 
have utility as diagnostic tools. 

This is not the same as an asserted utility of using a protein as a molecular weight marker 
since not all genes or proteins are differentially expressed in tumors. In addition, as detailed 
above, this is a specific utility since the PR0874 gene and protein are differentially expressed in 
a specific type of tumor, namely lung tumors. Thus, the Examiner's rejection of the first 
Grimaldi Declaration is based on a mischaracterization of Appellants' asserted basis for utility. 
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In rejecting the first Grimaldi Declaration, the Examiner also repeats his often made 
assertions that the data in Example 18 are unreliable, insufficient, lack statistical significance, 
and that "Applicants have failed to establish any correlation between PR0874 mRNA expression 
and PR0874 polypeptide expression" and therefore "the skilled artisan would not know if the 
difference in PR0874 mRNA is significant or insignificant, relevant or irrelevant, disease- 
dependent or disease-independent." Office Action at 5, 8. 

Appellants remind that PTO that the evidence of the asserted utility can be indirect 
evidence which is reasonably correlated to the asserted utility, and that the asserted utility need 
not be established to an absolute or statistical certainty or even beyond a reasonable doubt - the 
standard is more likely than not true. See M.P.E.P. § 2107.02, part VII; see also discussion infra 
at § VILB.9; Fujikawa v. Wattanasin, 93 F.3d 1559, 1565, 39 U.S.P.Q.2d 1895 (Fed. Cir. 1996) 
("a 'rigorous correlation' need not be shown in order to establish practical utility; 'reasonable 
correlation' suffices"). As discussed above, the uncontested first Grimaldi Declaration 
establishes that the data in Example 18 are reproducible, reliable, and significant enough that 
they can be used to distinguish tumor fi^om normal tissue. Appellants have also established that 
it is more likely than not that one of skill in the art would accept that changes in mRNA lead to 
corresponding changes in the encoded polypeptide. In light of all the evidence, one skilled in the 
art would find it more likely than not that the PR0874 polypeptide is differentially expressed in 
lung tumors based on the fact that the PR0874 mRNA is differentially expressed in lung tumors. 

Thus, contrary to the Examiner's assertions, Appellants have provided reliable and 
significant data regarding PR0874 mRNA expression; have established a correlation between 
PR0874 mRNA and protein expression because changes in protein level are predictable from 
changes in mRNA levels; and established that one skilled in the art would more likely than not 
believe that PR0874 polypeptide expression is differentially expressed in lung cancer. 

Finally, Appellants remind the PTO that "[ojffice personnel must accept an opinion from 
a qualified expert that is based upon relevant facts whose accuracy is not being questioned." 
M,P,KP, § 2107 (emphasis added). In addition, declarations relating to issues of fact should not 
be summarily dismissed as "opinions" without an adequate explanation of how the declaration 
fails to rebut the Examiner's position. See In re Alton 76 F.3d 1168 (Fed. Cir. 1996). The 
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Examiner has not supplied any reasons or evidence to question the accuracy of the facts upon 
which Mr. Grimaldi based his opinion. Mr. Grimaldi has personal knowledge of the relevant 
facts surrounding the performance and analysis of the data reported in Example 1 8, has based his 
opinion on those facts, and the Examiner has offered no reason or evidence to reject either the 
underlying facts or his opinion. 

In conclusion, Appellants submit that the evidence reported in Example 18, combined 
with the first Grimaldi Declaration, establish that there is at least a two-fold difference in 
PR0874 cDNA between lung tumor tissue and their respective normal tissue counterparts. 
Therefore, it follows that expression levels of the PR0874 gene and polypeptide can be used to 
distinguish lung tumor tissue from normal lung tissue. The Examiner has not offered any 
significant arguments or evidence to the contrary. 

b. The Examiner response to the Second Grimaldi Declaration 
To support their asserted utility, Appellants rely on paragraph 5 of the second Grimaldi 
Declaration: "Those who work in this field are well aware that in the vast majority of cases, when 
a gene is over-expressed... the gene product or polypeptide will also be over-expressed.... This 
same principal applies to gene under-expression." Second Grimaldi Declaration at ^ 5. Further, 
Appellants have cited the statement that "increased mRNA expression is expected to result in 
increased polypeptide expression, and the detection of decreased mRNA expression is expected 
to result in decreased polypeptide expression." Id. 

The Examiner asserts that the second Grimaldi Declaration is insufficient to overcome the 
rejection based upon a lack of utility. The Examiner repeats the same arguments made in 
rejecting the first Grimaldi Declaration: that the expression of all polynucleotides or polypeptides 
can invariably be classified as either increased, decreased, non-existent, or unchanged as 
compared to some standard level of expression; and that "no information is provided in the 
differential tissue expression of PR0874 mRNA regarding the level of expression, activity, or 
role in cancer of the PR0874 polypeptide." Final Office Action at 6. In addition, the Examiner 
relies on Allman et al. as evidence contrary to the secong Grimaldi Declaration. Office Action at 
15. 
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First, Appellants repeat that it is the difference in expression of the PR0874 mRNA in 
lung tumors compared to normal lung tissue which provides utility. Appellants maintain for the 
reasons articulated above that the data in Example 1 8 are significant and reliable, and that based 
on the evidence as a whole, one of skill in the art would accept that it is more likely than not that 
the PR0874 polypeptide is also differentially expressed in lung tumors. Therefore, Appellants 
have provided information regarding the expression level of the PR0874 polypeptide in cancer. 
As to any lack of evidence regarding the role or activity of the PR0874 polypeptide in cancer. 
Appellants reiterate that one does not need to know what role or activity PR0874 plays in cancer 
in order to use its differential expression to distinguish tumor tissue from normal tissue. 

Second, for the reasons discussed above, Allman is not contrary to the opinions expressed 
in the second Grimaldi Declaration because Allman did not observe a change in mRNA, and 
therefore does not say anything about whether changes in mRNA levels lead to corresponding 
changes in protein levels. 

The Examiner also rejects the second Grimaldi Declaration by arguing that the current 
situation is unlike the references cited in paragraph four of the declaration. The Examiner argues 
that unlike the situation in Grimaldi et al (Blood (1989) 73(8):2081-5) there is no evidence that 
the present situation involves the cloning of a chromosomal breakpoint, and unlike Meeker et al 
(Blood (1990) 76(2):285-9) and Singleton et al (Pathol. Annu. (1992) 27(l):165-90) where 
protein expression was associated with disease state, "the present specification does not provide 
any testing of the level of expression, activity, or role in cancer of the PR0874 polypeptide." 
Final Office Action at 7-8. 

As an initial matter, Appellants note that they have not relied on paragraph four of the 
second Grimaldi Declaration to support their asserted utility. Following a discussion of the 
Grimaldi et al, Meeker et al and Singleton et al references, the declaration states that "[i]f the 
chromosomal aberration results in the aberrant expression of mRNA and the corresponding gene 
product... then the gene product is a promising target for cancer therapy, for example, by the 
therapeutic antibody approach." Second Grimaldi Declaration at T| 4. Thus, paragraph four and 
the cited references relate to a discussion of the development of targets for cancer therapy, not 
cancer diagnostics. 
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In addition, the fact that there is no known translocation or mutation of PR0874 is 
irrelevant to whether its differential expression can be used to assist in diagnosis of cancer - one 
does not need to know why PR0874 is differentially expressed, or what the consequence of the 
differential expression is, in order to exploit the differential expression to distinguish tumor from 
normal tissue. As to the Examiner's argument that unlike Meeker et al and Singleton et al the 
present specification does not provide any testing of the level of expression, activity, or role in 
cancer of the PR0874 polypeptide, Appellants have provided information regarding changes in 
the expression of PR0874 mRNA in cancer, and one of skill in the art would recognize that this 
is reasonably correlated to changes in PR0874 protein expression such that Appellants have 
provided information regarding the expression of PR0874 polypeptide in cancer. 

The Revised Interim Utility Guidelines Training Materials available from the PTO's 
website establish that proteins which are differentially expressed in cancer have utility regardless 
of the lack of any known role in cancer. In Example 12 of the Guidelines, the hypothetical 
specification discloses a protein isolated from a cell membrane preparation which is the binding 
partner for protein X. Based on this relationship, the protein is designated receptor A. The 
function of X is not known, and the specification does not characterize the isolated protein with 
regard to its biological function or any disease. The Guidelines state that this disclosure is not 
sufficient to establish utility. However, in a caveat, the facts are changed such that the 
specification also discloses that receptor A is expressed on melanoma cells but not on normal 
skin cells. The guidelines state that this additional disclosure is sufficient to establish utility, in 
spite of the fact that nothing is disclosed about the role of the protein in melanoma. 

Appellants remind the PTO that "[ojffice personnel must accept an opinion from a 
qualified expert that is based upon relevant facts whose accuracy is not being questioned." 
M.P.E.P, § 2107 (emphasis added). In addition, declarations relating to issues of fact should not 
be summarily dismissed as "opinions" without an adequate explanation of how the declaration 
fails to rebut the Examiner's position. See In re Alton 76 F.3d 1168 (Fed. Cir. 1996). The 
Examiner has not supplied any reasons or evidence to reject either the underlying facts or opinion 
expressed in the second Grimaldi Declaration. 
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In conclusion, Appellants submit that the second Grimaldi declaration supports 
Appellants' assertion that one of skill in the art would be more likely than not to believe that the 
expression levels of the PR0874 gene and polypeptide can be used to distinguish lung tumor 
tissue from normal lung tissue. The Examiner has not offered any relevant arguments or 
evidence to the contrary, and therefore has not provided a basis to reject the second Grimaldi 
Declaration. 

c. The Examiner's response to the Polakis Declaration 
Appellants have relied on the statement of paragraph six of the Polakis Declaration to 
support their asserted utility: "an increased level of mRNA in a tumor cell relative to a normal 
cell typically correlates to a similar increase in abundance of the encoded protein in the tumor 
cell relative to the normal cell. In fact, it remains a central dogma in molecular biology that 
increased mRNA levels are predictive of corresponding increased levels of the encoded protein." 
Polakis Declaration at ^ 6. 

The Examiner has rejected the Polakis Declaration as insufficient to overcome the utility 
rejection because "[t]he present application provides no information regarding the level of 
expression, activity, or role in cancer of the PR0874 polypeptide. Only mRNA expression data 
was presented." Final Office Action at 9. The Examiner also states that the skilled artisan would 
not know if PR0874 mRNA expression was disease-dependent or disease-independent, would 
not know if or how PR0874 polypeptide expression would change in tumors." Office Action at 
15. 

Appellants submit that these arguments do not address the opinions expressed in the 
Polakis Declaration. As the second Grimaldi Declaration, the Polakis Declaration, and the other 
supporting references cited by the Appellants make clear, information about changes in mRNA 
levels is sufficient for one of skill in the art to conclude that it is more likely than not that the 
protein levels are similarly changed. Therefore, Appellants have provided information regarding 
the level of expression of the PR0874 polypeptide in cancer. 

Second, as discussed above, the Examiner's countervailing evidence does not support his 
arguments - Hu, LaBaer and Wang do not relate to using differentially expressed genes as 
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molecular markers; Haynes and Gygi examined steady-state mRNA and protein levels across 
different genes, not the relationship between changes in mRNA level and changes in protein 
levels, and Allman offers no support for the Examiner's position and arguably even supports 
Appellants' assertion; and Yousef reports a good correlation between changes in mRNA and 
protein. Therefore, the Examiner has not cited any relevant countervailing evidence on which to 
rely in rejecting the conclusions of the Polakis Declaration. 

The Examiner also rejects the Polakis Declaration, stating that a "dogma" is an 
authoritative principle, belief, or statement of ideas or opinion, especially one considered to be 
absolutely true. Final Office Action at 9. The Examiner asserts that Allman provide evidence 
that Dr. Polakis' asserted dogma is not "absolutely true" and that the skilled artisan would have a 
legitimate basis to doubt the utility of the PR0874 polypeptide based solely on the disclosure in 
Example 18. Id 

First, Appellants have emphasized that they do not need to prove an exact or absolute 

correlation between changes in mRNA and changes in protein levels to establish their asserted 

utility - the correlation does not need to be "absolutely true": 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a whole, 
it leads a person of ordinary skill in the art to conclude that the asserted utility is 
more likely than not true . M,P,E.P, § 2107.02, part VII (emphasis in original, 
citations omitted). 

In response to similar arguments made by Appellants, the Examiner stated that the 
rejection of Appellants' asserted utility "is not based upon 'an absolute certainty,' 'absolute 
predictability,' or 'an invariable exact correlation.'" Final Office Action at 3. 

In spite of this apparent acknowledgement of the proper standard for establishing utility, 
the Examiner rejects the Polakis declaration, arguing that Allman allegedly prove that Dr. 
Polakis' asserted dogma is not "absolutely true." Thus, it appears that the Examiner is 
continuing to require that Appellants establish their asserted utility as a matter of absolute or 
statistical certainty. 
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This standard is inconsistent with the Utility Guidelines and governing case law. 
Therefore, even if the Polakis "dogma" is not "absolutely true," this is not a basis to reject the 
asserted utility. Additionally, as noted above, the references relied on by the Examiner (Haynes, 
Gygi and Allman) do not actually provide any evidence that is contrary to the conclusions of the 
Polakis Declaration, and the Yousef references actually supports it. 

Appellants remind the PTO that "[o]ffice personnel must accept an opinion from a 
qualified expert that is based upon relevant facts whose accuracy is not being questioned." 
M.P.E.P, § 2107 (emphasis added). In addition, declarations relating to issues of fact should not 
be summarily dismissed as "opinions" without an adequate explanation of how the declaration 
fails to rebut the Examiner's position. See In re Alton 76 F.3d 1168 (Fed. Cir. 1996). The 
Examiner has not supplied any reasons or evidence to reject either the underlying facts or opinion 
expressed in the Polakis Declaration. 

In conclusion. Appellants submit that the Polakis Declaration supports Appellants' 
assertion that one of skill in the art would be more likely than not to believe that the expression 
levels of the PR0874 gene and polypeptide can be used to distinguish lung tumor tissue from 
normal lung tissue. The Examiner has not offered any relevant arguments or evidence to the 
contrary, and therefore has not provided a basis to reject the Polakis Declaration. 

d. The Examiner's response to the Molecular Biolosv of the Cell and 
Lewin references submitted by Appellants 
Appellants submitted the Molecular Biology of the Cell and Lewin references in support 
of their assertion that, in general, changes in mRNA expression correlates with changes in 
expression of the encoded polypeptide. Although the Examiner acknowledges the teachings in 
the Molecular Biology of the Cell and Lewin, he notes that Molecular Biology of the Cells states 
that "other controls can act later in the pathway from DNA to protein to modulate the amount of 
gene product that is made" and Lewin states that "production of RNA cannot inevitably be 
equated with production of protein." Office Action at 16-17. According to the Examiner, both 
Molecular Biology of the Cell and Lewin support and are consistent with the Examiner's position 
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that that the skilled artisan would not know if or how PR0874 polypeptide expression would 
change in cancer. Id at 17. 

Appellants have acknowledged that changes in mRNA do not always result in changes in 
protein level. However, Appellants do not have to prove utility to an "absolute certainty" or 
"beyond a reasonable doubt." Instead, one of skill in the art must believe that the asserted utility 
is more likely than not true. The Examiner fails to address the teachings of Molecular Biology of 
the Cell which provides that "[f|or most genes transcriptional controls are paramount ," Cell 3''^ at 
403 and Cell 4^^ at 379 (emphasis added); that "the initiation of gene transcription are the 
predominant form of regulation for most genes," Cell 3'^ at 453; that "a cell can change (or 
regulate) the expression of each of its genes according to the needs of the moment - most 
obviously by controlling the production of its mRNA ," Cell at 302 (emphasis added). 
Similarly, Figure 6-90 on page 364 of Cell 4*^ illustrates the path from gene to protein. The 
Examiner has not addressed the teachings of Lewin either, which states "having acknowledged 
that control of gene expression can occur at multiple stages, and that production of RNA cannot 
inevitably be equated with production of protein, it is clear that the overwhelming majority of 
regulatory events occur at the initiation of transcription ." Genes VI at 847-848 (emphasis added). 

The Examiner does not provide a basis for failing to address evidence of the 
"overwhelming majority" and "paramount" controls of gene expression, and instead focuses on 
"other controls" that can act later. Appellants submit that a full consideration of the submitted 
evidence would lead to a conclusion that the "overwhelming majority" and "paramount" controls 
have a greater influence on gene expression than "other controls" that can act later. Accordingly, 
full consideration of this evidence would lead to a conclusion that mRNA levels are paramount 
in influencing polypeptide expression levels, and that one of skill in the art would conclude that 
it is more likely than not that changes in mRNA lead to changes in the corresponding protein. 
Therefore, the Cell and Lewin references support Appellants' asserted utility, and the Examiner 
has not provided any basis to reject Appellants' conclusions based on these references. 
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e. The Examiner's response to the Zliisans reference submitted by 
Appellants 

Appellants submitted the Zhigang reference in support of their position that, in general, 
differential mRNA expression correlates with differential expression of the encoded polypeptide. 
The Examiner acknowledges that Zhigang presents data showing a high degree of correlation 
between changes in PSCA mRNA and protein expression. However, the Examiner asserts that 
exceptions were noted. Office Action at 1 7. Thus, according to the Examiner, Zhigang supports 
and is consistent with the Examiner's position that "the skilled artisan would not know if or how 
PR0874 polypeptide expression would change in cancer." Id 

Zhigang reported that the correlation between mRNA expression and protein expression 
occurred in 93% of the samples tested. Appellants submit that there is no requirement to provide 
evidence sufficient to establish an asserted utility as a matter of statistical certainty. "Instead, 
evidence will be sufficient if, considered as a whole, it leads a person of ordinary skill in the art 
to conclude that the asserted utility is more likely than not true ." M,P.E.P. at § 2107.02, part VII 
(2004) (emphasis in original, internal citations omitted). 

In previous responses, Appellants have emphasized that they do not need to prove an 
exact or absolute correlation between changes in mRNA and changes in protein levels to 
establish their asserted utility. In response, the Examiner previously stated that the final rejection 
"is not based upon 'an absolute certainty,' 'absolute predictability,' or 'an invariable exact 
correlation.'" Final Office Action 2ii 3. 

In spite of this apparent acknowledgement of the proper standard for establishing utility, 
the Examiner holds that a showing of 93% correlation between mRNA and polypeptide levels is 
an indication of support for unpredictability of the relationship between changes in mRNA and 
polypeptide levels. Thus, it is clear from the Examiner's evaluation of the evidence of record 
that the Examiner is requiring that Appellants establish their asserted utility as a matter of 
statistical or absolute certainty. This standard is inconsistent with the Utility Guidelines and 
governing case law. Since the Examiner's rejection of the teachings of the Zhigang reference is 
based on this incorrect standard, the Examiner has not provided any reasonable basis to reject 
Appellants' conclusion that Zhigang supports Appellants' asserted utility. 



-50- 



AppL No. : 10/063,514 

Filed : May 1, 2002 



/ The Examiner response to the Meric reference submitted by 
Appellants 

Appellants submitted the Meric reference in support of their position that, in general, 
changes in mRNA expression correlate with changes in expression of the encoded polypeptide. 
Appellants rely on the statement in Meric that "[t]he fundamental principle of molecular 
therapeutics in cancer is to exploit the differences in gene expression between cancer cells and 
normal cells." Meric at 971 (emphasis added). 

While the Examiner acknowledges this statement, the Examiner asserts that the 
specification does not provide any testing of the level of expression, activity, or role in cancer of 
the PR0874 polypeptide. Office Action at 18. Therefore, according to the Examiner, the 
differences in PR0874 polypeptide expression between cancer cells and normal cells are 
unknown, and thus they are not exploitable. Id. The Examiner states that Meric also 
acknowledges that several alterations in translational control occur in cancer and that gene 
expression is quite complicated. Id, at 18. Thus, the Examiner asserts that Meric suggests that 
protein levels can be modulated independently of the level of mRNA, and therefore Meric 
supports and is consistent with his position that "the skilled artisan would not know if or how 
PR0874 polypeptide expression would change in cancer." Id, 

Meric does teach that mutations of genes as well as alternate splicing and alternate 
transcription start sites can lead to altered translation efficiency in certain cancer cells. Meric at 
973-974. As support, Meric cites three examples of point mutations, and four examples of 
alternate splicing. Id, at 974. However, the Examiner has not shown, and there is no evidence, 
that the PR0874 mRNA is either mutated, alternately spliced, or has an alternate transcription 
start site. Nor has the Examiner established that point mutations or alternate splice variants 
leading to changes in translation efficiency are common in cancer, or common in lung tumors in 
particular. These few examples are not sufficient to provide evidence that one skilled in the art 
would reasonably doubt Appellants' asserted utility, or reject the teaching of Appellants' 
supporting references and declarations. 
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As the supporting references and declarations Appellants have discussed above make 
clear, regulation of mRNA levels is the predominant control mechanism for the majority of 
genes. Meric supports this assertion, stating "[t]he fundamental principle of molecular 
therapeutics in cancer is to exploit the differences in gene expression between cancer cells and 
normal cells." Meric at 971 (emphasis added). The only reason differences in mRNA expression 
between cancer and normal tissue are of any interest in studying the mechanism of cancer 
formation and growth is because mRNA encodes protein. If there were no general correlation 
between differences in mRNA and differences in protein, there would be no reason to study 
changes in mRNA . Thus, Meric supports Appellants' assertion that those of skill in the art 
recognize that there is a general correlation between changes in mRNA levels and changes in the 
level of the corresponding protein. 

As the Examiner has not offered any significant reasoning or evidence to the contrary, the 
teaching of Meric and Appellants' conclusion that it supports their asserted utility should be 
accepted as true. 

g. Conclusion 

In conclusion, Appellants have offered sufficient evidence to establish that one of skill in 
the art would be more likely than not to believe that because the PR0874 mRNA is differentially 
expressed in lung tumors compared to normal lung tissue, the PR0874 polypeptide will also be 
differentially expressed in lung tumors. This differential expression of the PR0874 polypeptide 
makes it useful as a diagnostic tool for cancer. The declarations and references cited by 
Appellants clearly support their asserted utility, and the Examiner has offered no relevant 
arguments or evidence to the contrary. In short, none of the Examiner's responses to Appellants' 
supporting evidence are sufficient to rebut Appellants' asserted utility. 

9. The Courts have held that the Utility Requirement was Satisfied in Similar 
Cases 

The seminal decision interpreting the utility requirement of 35 U.S.C. § 101 is Brenner v. 
Manson, 383 U.S. 519, 148 U.S.P.Q. 689 (1966). At issue in Brenner was a claim to "a chemical 
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process which yields an already known product whose utility - other than as a possible object of 
scientific inquiry - ha[d] not yet been evidenced." Id. at 529, 148 U.S.P.Q. at 693. The Patent 
Office rejected the claimed process for lack of utility because the product produced by the 
claimed process had no known use. See id at 521-22, 148 U.S.P.Q. at 690. On appeal, the Court 
of Customs and Patent Appeals reversed, holding "where a claimed process produces a known 
product it is not necessary to show utility for the product." Id. at 522, 148 U.S.P.Q, at 691. 

In reviewing the lower court's decision, the Court made its oft quoted statement that 
"[t]he basic quid pro quo contemplated by the Constitution and the Congress for granting a patent 
monopoly is the benefit derived by the public from an invention with substantial utility. Unless 
and until a process is refined and developed to this point - where specific benefit exists in 
currently available form - there is insufficient justification for permitting an Appellant to engross 
what may prove to be a broad field." Id at 534-35, 148 U.S.P.Q. at 695. 

The first opinion of the C.C.P.A. applying Brenner was In re Kirk, 376 F.2d 936, 153 
U.S.P.Q. 48 (C.C.P.A. 1967). The invention claimed in Kirk was a set of steroid derivatives said 
to have valuable biological properties and to be of value "in the furtherance of steroidal research 
and in the application of steroidal materials to veterinary or medical practice." Id, at 938, 153 
U.S.P.Q, at 50. In affirming the claim rejection based on a lack of utility, the court held that the 
"nebulous expressions 'biological activity' or 'biological properties'" did not adequately convey 
how to use the claimed compounds." Id. at 941, 153 U.S.P.Q. at 52. The court also rejected 
Appellants' supporting affidavit, stating, "the sum and substance of the affidavit appears to be 
that one of ordinary skill in the art would know 'how to use' the compounds to find out in the 
first instance whether the compounds are - or are not - in fact usefiil or possess useful properties, 
and to ascertain what those properties are." Id. at 942, 153 U.S.P.Q. at 53. 

Since these early decisions, the courts have continued to clarify what is sufficient to 
satisfy the utility requirement. Three more recent decisions are of particular relevance to the 
instant application: Nelson v. Bowler, 626 F.2d 853, 206 U.S.P.Q. 881 (C.C.P.A. 1980), Cross v. 
lizuka, 753 F.2d 1040, 224 U.S.P.Q. 739 (Fed. Cir. 1985), and Fujikawa v. Wattanasin, 93 F.3d 
1559, 39 U.S.P.Q. 2d 1895 (Fed. Cir. 1996). 
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The earliest of these cases. Nelson v. Bowler, involved an interference between two 
applications related to derivatives of naturally occurring prostaglandins (PG). Nelson, 626 F.2d 
at 854-55, The issue was whether Nelson had shown at least one utility for the compounds at 
issue to establish an actual reduction to practice. M at 855. The Appellants relied on two tests 
to prove practical utility: an in vivo rat blood pressure (BP) test and an in vitro gerbil colon 
smooth muscle stimulation (GC-SMS) test. In the BP test, the blood pressure of anesthetized rats 
was recorded on a polygraph chart to determine whether an injected compound had any effect. 
Responses were categorized as either a depressor (lowering) effect or a pressor (elevating) effect. 
Id. In the GC-SMS test a section of colon was excised from a freshly-killed gerbil for suspension 
in a physiological solution, and a lever arm was connected to the colon in such a way that any 
contraction was recorded as a polygraph trace. Id. The Board held that Nelson had not shown 
adequate proof of practical utility, characterizing the tests as "rough screens, uncorrelated with 
actual utility," Id, at 856. 

On appeal the C.C.P.A. reversed, holding that the Board "erred in not recognizing that 

tests evidencing pharmacological activity may manifest a practical utility even though they may 

not establish a specific therapeutic use." M The Court stated that "practical utility" was 

characterized as a use of the claimed discovery in a manner which provides some immediate 

benefit to the public, establishing the following rule: 

Knowledge of the pharmacological activity of any compound is obviously 
beneficial to the public. It is inherently faster and easier to combat illnesses and 
alleviate symptoms when the medical profession is armed with an arsenal of 
chemicals having known pharmacological activities. Since it is crucial to provide 
researchers with an incentive to disclose pharmacological activities in as many 
compounds as possible, we conclude that adequate proof of any such activity 
constitutes a showing of practical utility. Id. (emphasis added). 

The Court rejected Bowler's argument that the BP and GC-SMS tests are inconclusive 
showings of pharmacological activity since confirmation by statistically significant means did not 
occur until after the critical date. The Court stated that "a rigorous correlation is not necessary 
where the test for pharmacological activity is reasonably indicative of the desired response." Id. 
(emphasis added). The Court concluded that a " reasonable correlation " between the observed 
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properties and the suggested use was sufficient to establish practical utility. Id, at 857 (emphasis 
added). 

The sufficiency of a "reasonable correlation" in establishing utility was affirmed by the 
Court of Appeals for the Federal Circuit in Cross v. lizuka, 753 F.2d 1040, 224 U.S.P.Q. 739 
(Fed. Cir. 1985). In Cross^ the subject of the interference before the Court was imidazole 
derivative compounds which inhibit the synthesis of thromboxane synthetase, an enzyme which 
leads to the formation of thromboxane A2. At the time the applications were filed, 
thromboxane A2 was postulated to be involved in platelet aggregation, which was associated with 
several deleterious conditions. Id, at 1042. 

The question before the Board and reviewed by the Court was whether lizuka was entitled 
to the benefit of his Japanese priority application. Id, The Japanese application disclosed that 
the imidazole derivatives showed strong inhibitory action for thromboxane synthetase from 
human or bovine platelet microsomes, an in vitro utility. Id, at 1043. Relying in part on Nelson, 
the Board held that tests evidencing pharmacological activity may manifest a practical utility 
even though they may not establish a specific therapeutic use, and concluded that the in vitro 
tests were sufficient to establish a practical utility. Id, 

On appeal, Cross argued that the basic in vitro tests conducted in cellular fractions did not 
establish a practical utility for the claimed compounds, and that more sophisticated in vitro or in 
vivo tests were necessary to establish a practical utility. Id. at 1050. The Court rejected this 
argument, noting that adequate proof of any pharmaceutical activity constitutes a showing of 
practical utility. Id. The Court accepted the argument that initial testing of compounds is widely 
done in vitro: 

[I\n vitro results... are generally predictive of in vivo test results, i.e., there is a 
reasonable correlation therebetween. Were this not so, the testing procedures of 
the pharmaceutical industry would not be as they are. lizuka has not urged, and 
rightly so, that there is an invariable exact correlation between in vitro test results 
and in vivo test results. Rather, lizuka's position is that successfiil in vitro testing 
for a particular pharmacological activity establishes a significant probability that 
in vivo testing for this particular pharmacological activity will be successfiil. Id, 
(emphasis added). 
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The Court also noted that in previous decisions, its predecessor court had accepted 

evidence of in vivo utility as sufficient to establish practical utility. The Court reasoned that: 

This in vivo testing is but an intermediate link in a screening chain which may 
eventually lead to the use of the drug as a therapeutic agent in humans. We 
perceive no insurmountable difficulty, under appropriate circumstances, in finding 
that the first link in the screening chain, in vitro testing, may establish a practical 
utility for the compound in question. Successful in vitro testing will marshal 
resources and direct the expenditure of effort to fiirther in vivo testing of the most 
potent compounds, thereby providing an immediate benefit to the public, 
analogous to the benefit provided by the showing of an in vivo utility . Id at 1051, 
citing Nelson, 626 F.2d at 856 (emphasis added). 

Based on this reasoning, the Court affirmed the decision of the Board, stating that "based 
upon the relevant evidence as a whole, there is a reasonable correlation between the disclosed in 
vitro utility and an in vivo activity, and therefore a rigorous correlation is not necessary where the 
disclosure of pharmacological activity is reasonable based upon the probative evidence." Id at 
1050 (emphasis added). The Court therefore held that the disclosed in vitro utility was 
"sufficient to comply with the practical utility requirement of § 101." Id. at 1051. 

The holdings of Nelson and Cross were more recently affirmed in Fujikawa v. 

Wattanasin, 93 F.3d 1559, 39 U.S.P.Q.2d 1895 (Fed. Cir. 1996). In Fujikawa, the Court again 

affirmed the notion that initial screens of compounds provide a practical utility even though they 

may not provide a therapeutic use because '"[i]t is inherently faster and easier to combat illnesses 

and alleviate symptoms when the medical profession is armed with an arsenal of chemicals 

having known pharmacological activities.'" Id, at 1564, quoting Nelson, 626 F.2d at 856. The 

Court noted that it may be difficult to predict whether novel compounds will exhibit 

pharmacological activity, and consequently testing is often required to establish practical utility. 

Id. However the Court went on to state: 

But the test results need not absolutely prove that the compound is 
pharmacologically active. All that is required is that the tests be ''reasonably 
indicative of the desired [pharmacological] response." In other words, there must 
be a sufficient correlation between the tests and an asserted pharmacological 
activity so as to convince those skilled in the art, to a reasonable probability , that 
the novel compound will exhibit the asserted pharmacological behavior." Id, 
(internal citations omitted, underline emphasis added, italics in original). 
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On appeal, Fujikawa argued that Wattanasin had failed to establish an adequate 
correlation between the in vitro and in vivo results to permit Wattanasin to rely on positive in 
vitro results to establish a practical utility. The Court stated that the Board relied on testimony 
from those skilled in the art that the in vitro results convinced the experts that the claimed 
compounds would exhibit the desired pharmacological activity when administered in vivo, 
including testimony that in vivo activity is typically highly correlatable to a compound's in vitro 
activity in the field. Id at 1565. To overcome this evidence and counter the Board's decision, 
Fujikawa pointed to the testimony of its expert that "there is a reasonable element of doubt that 
some elements may be encountered which are active in the in vitro assay, but yet inactive in the 
in vivo assay." Id 

The Court rejected this argument: "Of course, it is possible that some compounds active 
in vitro may not be active in vivo. But, as our predecessor court in Nelson explained, a 'rigorous 
correlation' need not be shown in order to establish practical utility; 'reasonable correlation' 
suffices ." Id. (emphasis added). The Court also rejected Fujikawa's reliance on two articles. 
The Court noted that while one article taught that "in vitro testing is sometimes not a good 
indicator of how potent a compound will be in vivo, it does imply that compounds which are 
active in vitro will normally exhibit some in vivo activity." Id. at 1566. Similarly, the Court 
noted that the second article expressly stated that "[flor most substances, although not for all, the 
relative potency determined in in vitro ... parallels the in vivo activity." Id. 

The Court concluded that the facts in the case were analogous to the ones in Cross where 
the court relied on a knovsoi reasonable correlation between in vitro tests and in vivo activity, and 
therefore affirmed the Board's decision that Wattanasin had established a practical utility with 
the in vitro results. Id, at 1565-66. 

The Nelson, Cross, and Fujikawa cases are very similar to the present case. The 
reasoning of the courts in all three cases that "'[i]t is inherently faster and easier to combat 
illnesses and alleviate symptoms when the medical profession is armed with an arsenal of 
chemicals having known pharmacological activities'" applies to the asserted utility for the 
claimed polypeptides. Fujikawa, 93 F.3d at 1564, quoting Nelson, 626 F.2d at 856; see also 
Cross, 753 F.2d at 1051 ("Successful in vitro testing will marshal resources and direct the 
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expenditure of effort to further in vivo testing of the most potent compounds, thereby providing 
an immediate benefit to the public, analogous to the benefit provided by the showing of an in 
vivo utility.")- Like pharmaceutical compounds, nucleic acids, polypeptides, and antibodies 
which are associated with cancer will make it inherently faster and easier to combat cancer. The 
greater the number of biological markers of cancer medical professionals have access to, the 
more accurate and detailed a diagnosis they can make. The determination that a gene is 
differentially expressed in cancer constitutes at least as significant a development in the field of 
cancer diagnostics as in vitro screening for pharmaceutical activity. See Cross, 753 F.2d at 1051 
("the first link in the screening chain, in vitro testing, may establish a practical utility for the 
compound in question. Successful in vitro testing will marshal resources and direct the 
expenditure of effort to further in vivo testing of the most potent compounds, thereby providing 
an immediate benefit to the public"). 

In addition, like in vitro tests in the pharmaceutical industry, those of skill in the field of 
biotechnology rely on the reasonable correlation that exists between changes in gene expression 
and changes in protein expression (see discussion supra). Were there no reasonable correlation 
between the two, the techniques that measure gene levels such as microarray analysis, differential 
display, and quantitative PGR would not be so widely used by those in the art. See Second 
Grimaldi Declaration at ^ 5. As in Cross, Appellants here do not argue that there is "an 
invariable exact correlation" between gene expression and protein expression. See Cross, 753 
F.2d at 1050. Instead, Appellants' position detailed above is that a measured change in gene 
expression in cancer cells establishes a "significant probability" that the expression of the 
encoded polypeptide in cancer will also be changed based on "a reasonable correlation 
therebetween." M; see also Fujikawa, 93 F.3d at 1565 ("a 'rigorous correlation' need not be 
shown in order to establish practical utility; 'reasonable correlation' suffices"); Nelson, 626 F.2d 
at 857 (holding that "a rigorous correlation is not necessary" and that a "reasonable correlation" 
will suffice). 

Also of importance is the Court's rejection of the notion that any in vitro testing must be 
statistically significant to support a practical utility. Nelson, 626 F.2d at 857. Likewise, 
qualitative characterizations of a test compound as either increasing or decreasing blood pressure 
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was acceptable. Id at 855 (stating that responses were categorized as either a depressor 
(lowering) effect or a pressor (elevating) effect). This is similar to the data in Example 1 8, where 
the change in mRNA levels is described as "more highly expressed." 

There are additional similarities. In Fujikawa, the Board and Court rejected the argument 
that there was no utility because there was no exact correlation between the in vitro and in vivo 
resuhs in spite of supporting testimony and references. Fujikawa, 93 F.3d at 1565-66, Like the 
two references rejected by the Board and Court in Fujikawa, portions of the references of record 
may suggest that the correlation between changes in mRNA levels and protein levels is not exact. 
But like the references in Fujikawa, the majority of each reference supports Appellants' 
assertion, and Appellants have submitted the declaration of two experts in the field which state 
that those skilled in the art rely on the correlation between changes in mRNA and protein. See 
Second Grimaldi Declaration at ^ 5; Polakis Declaration at ^ 6. Thus, as was the case in 
Fujikawa, although there may be some evidence that the correlation relied on is not exact, the 
declarations and numerous references submitted by Appellants are more than enough evidence to 
establish that there is a "reasonable correlation" between changes in mRNA levels and changes in 
the level of the encoded protein. 

In conclusion, Appellants have asserted that the claimed polypeptides are useful for the 
diagnosis of cancer, particularly lung cancer, based on the data in Example 18. This utility is far 
beyond the nebulous expressions "biological activity" or "biological properties" rejected in In re 
Kirk, 376 F.2d 936, 153 U.S.P.Q. 48 (C.C.P.A. 1967). Like Nelson, Cross, and Fujikawa, 
Appellants have asserted a utility which relies on a reasonable correlation between the data 
disclosed in the application and the asserted utility. The fact that there may be limited evidence 
that the correlation is not exact does not invalidate Appellants' showing of utility since the 
correlation need not be a rigorous or exact one. Considering the relevant evidence as a whole. 
Appellants have provided sufficient evidence to establish a reasonable correlation between 
changes in the level of mRNA and corresponding changes in the level of the encoded 
polypeptide. Therefore the claimed polypeptides have a practical utility as diagnostic tools for 
lung cancer. 
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10. Utility - Conclusion 

Appellants' asserted utility for the claimed polypeptides as diagnostic tools for cancer 
corresponds in scope to the subject matter sought to be patented and therefore "must be taken as 
sufficient to satisfy the utility requirement of § 101 for the entire claimed subject." In re Longer, 
503 F,2d 1380, 1391, 183 U.S.P.Q. 288, 297 (C.C.P.A. 1974). The Examiner's unsupported 
arguments and references are not sufficient evidence to make a prima facie showing that "one of 
ordinary skill in the art would reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 
1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). 

And even if the Examiner has established a prima facie case, Appellants have offered 
sufficient rebuttal evidence in the form of expert declarations and references, which, when 
considered as a whole, establish that it is more likely than not that the asserted utility is true. See 
In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 1444 (Fed. Cir. 1992) (stating that the 
evidentiary standard to be used throughout ex parte examination in setting forth a rejection is a 
preponderance of the evidence, or "more likely than not" standard); M.P.E.P. at § 2107.02, part 
VII ("evidence will be sufficient if, considered as a whole, it leads a person of ordinary skill in 
the art to conclude that the asserted utility is more likely than not true .") (emphasis in original). 

Finally, the courts' decisions in similar cases make clear that the evidence provided by 
Appellants is sufficient to establish the asserted utility. The evidence does not need to be direct 
evidence, nor does it need to provide an exact correlation between the submitted evidence and 
the asserted utility. Instead, evidence which is "reasonably" correlated with the asserted utility is 
sufficient. See Fujikawa, 93 F.3d at 1565 ("a 'rigorous correlation' need not be shown in order 
to establish practical utility; 'reasonable correlation' suffices"); Cross, 753 F.2d at 1050 (same); 
Nelson, 626 F.2d at 857 (same). Considering the evidence as a whole in light of the relevant case 
law, the Board should find that Appellants have established at least one specific, substantial, and 
credible utility, and the Examiner's rejection of the pending claims as lacking utility should be 
reversed. 
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C. Enablement Rejection - Detailed Argument 

The second issue before the Board is whether Appellants have enabled the pending claims 
such that one of skill in the art would be able to make and use the claimed invention. The 
Examiner has rejected pending Claims 6-7, 9 and 11-17 under 35 U.S.C. §112, first paragraph, 
arguing that because the claimed invention is not supported by either a specific or substantial 
asserted utility or a well-established utility, one skilled in the art clearly would not know how to 
use the claimed invention. See Office Action at 20. 

In addition, the Examiner has separately rejected pending Claims 12-17 under 35 U.S.C. 
1 12, first paragraph, on the assertion that they contain subject matter which was not described in 
the specification in such a way as to enable one skilled in the art to make and/or use the 
invention. Office Action ai2\ -23, 

As an initial matter, for the reasons discussed above, Appellants maintain that the claimed 
polypeptides possess a specific, substantial and credible utility. Accordingly, Appellants 
maintain that one of skill in the art would know how to use the claimed invention, and to the 
extent that the enablement rejection is based on a lack of utility, it should be reversed. 

In response to the separate enablement rejection of Claims 12-17, Appellants submit that 
Claims 12-17 are enabled such that one of skill in the art could make and use the claimed 
polypeptides without undue experimentation. With respect to Claims 12-13, how to make 
chimeric polypeptides comprising the recited polypeptides of related to SEQ ID NO: 10 and the 
polypeptide encoded by the cDNA deposited under ATCC accession number 209922 is within 
the skill in the art. Similarly, with respect to Claims 14-17, it is well within the skill of those in 
the art to make polypeptides that are at least 95% identical to the recited portions of SEQ ID 
NO: 10 and the polypeptide encoded by ATCC 209922, and it is well within those of skill in the 
art to make antibodies which are specific to a disclosed sequence. See In re Wands, 858 F.2d 731 
(reversing the Board's decision of non-enablement and holding that as of 1980, undue 
experimentation was not required to make high-affinity monoclonal antibodies to a target 
peptide). Thus, one of skill in the art would be able to make the claimed polypeptides without 
undue experimentation. 
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As described above. Appellants assert that the claimed polypeptides are useftil as 
diagnostic tools for cancer, particularly lung cancer. This use is based on the disclosure in 
Example 18 of the instant application that the nucleic acid encoding the PR0874 polypeptide is 
expressed at least two-fold higher in normal lung tissue compared to lung tumor tissue. As 
detailed above, it is well-established that changes in expression levels of mRNA leads to 
corresponding changes in expression levels of the encoded polypeptide, and thus it is likely that 
the PR0874 polypeptide is also differentially expressed in lung tumors. Thus, based on the 
disclosure in the application, one of skill in the art would be able to use the claimed polypeptides 
as diagnostic tools without undue experimentation, for example to make antibodies to distinguish 
suspected lung tumor tissue from normal lung tissue. 

1. Enablement - Lesal Standard 

An application enables the claims "if one skilled in the art, after reading the[] 
disclosure[], could practice the invention claimed ... without undue experimentation." Chiron 
Corp. V. GenentecK Inc., 363 F.3d 1247, 1253 (Fed. Cir. 2004). "But the question of undue 
experimentation is a matter of degree. The fact that some experimentation is necessary does not 
preclude enablement; what is required is that the amount of experimentation 'must not be unduly 
extensive.'" PPG Indus., Inc. v. Guardian Indus., Corp., 75 F.3d 1558, 1564 (Fed. Cir. 1996) 
(quoting Atlas Powder Co. v. EJ. DuPont de Nemours & Co., 750 F.2d 1569, 1576 (Fed. Cir. 
1984)). 

While the application must enable one of ordinary skill in the art to practice the full scope 
of the claimed invention, "[t]hat is not to say that the specification itself must necessarily 
describe how to make and use every possible variant of the claimed invention, for the artisan's 
knowledge of the prior art and routine experimentation can often fill gaps, interpolate between 
embodiments, and perhaps even extrapolate beyond the disclosed embodiments, depending upon 
the predictability of the art." AK Steel Corp. v. Sollac, 344 F.3d 1234, 1244 (Fed. Cir. 2003). 

"Enablement is not precluded by the necessity for some experimentation such as routine 
screening. However, experimentation needed to practice the invention must not be undue 
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experimentation. The key word is 'undue,' not 'experimentation.'" In re Wands 858 F.2d 731, 

736-7, 8 U.S.P.Q.2d 1400, (Fed. Cir. 1988), citations omitted. 

It is equally clear that a rejection based on "lack of utility " whether grounded 
upon 35 U.S.C. 101 or 35 U.S.C. 1 12, first paragraph, rests on the same basis (i.e., 
the asserted utility is not credible) . To avoid confusion, any rejection that is 
imposed on the basis of 35 U.S.C. 101 should be accompanied by a rejection 
based on 35 U.S.C. 112, first paragraph. The 35 U.S.C. 112, first paragraph, 
rejection should be set out as a separate rejection that incorporates by reference 
the factual basis and conclusions set forth in the 35 U.S.C. 101 rejection. The 35 
U.S.C. 112, first paragraph, rejection should indicate that because the invention as 
claimed does not have utility, a person skilled in the art would not be able to use 
the invention as claimed, and as such, the claim is defective under 35 U.S.C. 112, 
first paragraph. A 35 U.S.C. 1 12, first paragraph, rejection should not be imposed 
or maintained unless an appropriate basis exists for imposing a rejection under 35 
U.S.C. 101. In other words. Office personnel should not impose a 35 U.S.C. 1 12, 
first paragraph, rejection grounded on a "lack of utility" basis unless a 35 U.S.C. 
101 rejection is proper . In particular, the factual showing needed to impose a 
rejection under 35 U.S.C. 101 must be provided if a rejecfion under 35 U.S.C. 
1 12, first paragraph, is to be imposed on "lack of utility" grounds. 

To avoid confusion during examination, any rejection under 35 U.S.C. 112, first 
paragraph, based on grounds other than "lack of utility" should be imposed 
separately from any rejection imposed due to "lack of utility" under 35 U.S.C. 101 
and 35 U.S.C. 1 12, first paragraph. M.P.KP. § 2107.01 IV (emphasis added). 

2. Enablement - Burden of Proof 

In order to make an enablement rejection, the PTO has the initial burden to establish a 
reasonable basis to question the enablement provided for the claimed invention. See M,P,E.P, § 
2164.04. A specification teaching how to make and use the claimed subject matter must be taken 
as being in compliance with the enablement requirement unless there is a reason to doubt the 
objective truth of the statements contained therein which are relied on for enabling support. Id, 
It is incumbent for the PTO "to explain why it doubts the truth or accuracy of any statement in a 
supporting disclosure and to back up assertions of its own with acceptable evidence or reasoning 
which is inconsistent with the contested statement." Id. (quoting In re Marzocchi, 439 F.2d 220, 
224, 169 U.S.P.Q. 367, 370 (C.C.P.A. 1971). This can be done "by making specific findings of 
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fact, supported by the evidence, and then drawing conclusions based on these findings of fact.'' 

Id. 

3. Enablement - Standard of Proof 

Once the examiner has weighed all the evidence and established a reasonable basis to 
question the enablement provided for the claimed invention, the burden falls on the applicant to 
present persuasive arguments, supported by suitable proofs where necessary, that one skilled in 
the art would be able to make and use the claimed invention using the application as a guide. See 
M.P.E.P. § 2164.05. "The evidence provided by applicant need not be conclusive but merely 
convincing to one skilled in the art." Id. (bold emphasis added, underline in original). "A 
declaration or affidavit is, itself, evidence that must be considered ." Id. (emphasis in original). 

The examiner must then "weigh all the evidence before him or her, including the 
specification and any new evidence supplied by applicant with evidence and/or sound scientific 
reasoning previously presented in the rejection and decide whether the claimed invention is 
enabled." Id, "The examiner should never make the determination based on personal opinion. 
The determinafion should always be based on the weight of all the evidence." Id. (emphasis in 
original). 

4. Appellants^ Specification Teaches How to Make and Use the Claimed Subject 
Matter 

The specification enables one skilled in the art to make and use the fiill scope of the 
claims without undue experimentation. The claimed subject matter relates to polypeptides of 
SEQ ID NO: 10 and the polypeptide encoded by ATCC deposit 209922, and polypeptides which 
are at least 95% identical to those polypeptides and which can be used to make antibodies that 
specifically detect PR0874 in lung tissue. 

The specification discloses how to make the claimed polypeptides, for example in 
paragraphs [0283]-[0315] and Examples 6-9 (HH [0453]-[0492]). In addition, methods for 
making polypeptides which are at least 95% identical to SEQ ID NO: 10 by making substitutions 
or deletions are also disclosed in the specification and were well known in the art. See e.g., 
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Specification at paragraphs [0256]-[0271]. Methods for making and testing antibodies for 
specificity were well known in the art, and are disclosed in the specification, including 
paragraphs [0361]-[0379] and Example 10 [0493]-[0499]), which specifically describes the 
preparation of antibodies that bind PRO polypeptides. In addition, the specification discloses 
that antibodies to the claimed polypeptides can be used in diagnostic assays to detect the 
expression of PR0874 in specific types of tissue. See e,g,, Specification at [0407]. 

In light of the differential expression of the nucleic acid encoding the PR0874 
polypeptide in lung tumor tissues compared to normal lung tissue, one of skill in the art would 
expect the PR0874 polypeptide to be differentially expressed in these tumors as well. Therefore, 
given the teaching in the specification on how to make and use the claimed polypeptides to detect 
expression of PR0874 in specific tissues, one of skill in the art would be enabled to practice the 
claimed invention without undue experimentation. 

Because Appellants' specification teaches how to make and use the claimed subject 
matter, it must be taken as being in compliance with the enablement requirement unless there is a 
reason to doubt the objective truth of the statements contained therein which are relied on for 
enabling support. See M.P.E.P. §2164.04. 

5. The Examiner's Arguments Fail to Establish a Reasonable Basis to Question 
the Enablement Provided for the Claimed Invention in the Specification 

The PTO has the initial burden to establish a reasonable basis to question the enablement 
provided for the claimed invention. See M.P.E.P. § 2164.04. It is incumbent for the PTO "to 
explain why it doubts the truth or accuracy of any statement in a supporting disclosure and to 
back up assertions of its own with acceptable evidence or reasoning which is inconsistent with 
the contested statement." Id (quoting In re Marzocchi, 439 F,2d 220, 224, 169 U.S.P.Q. 367, 
370 (C.C.P.A. 1971). This can be done "by making specific findings of fact, supported by the 
evidence, and then drawing conclusions based on these findings of fact." Id. 

First, the Examiner has rejected pending Claims 6-7, 9 and 11-17 as not being enabled 
because the claimed invention is not supported by either a specific or substantial asserted utility 
or a well-established utility. See Office Action at 20. For the reasons discussed above, the 
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claimed polypeptides possess a specific, substantial and credible utility, and to the extent that the 
enablement rejection is based on a lack of utility, it should be reversed. 

Second, the Examiner has separately rejected pending Claims 12-17 under 35 U.S.C. 1 12, 
first paragraph, on the assertion that they contain subject matter which was not described in the 
specification in such a way as to enable one skilled in the art to make and/or use the invention. 
Office Action at 21-23. 

The Examiner makes several arguments in rejecting claims which recite the limitation 
"wherein said isolated polypeptide is more highly expressed in normal lung tissue compared to 
lung tumor tissue, or wherein said isolated polypeptide is encoded by a polynucleotide that is 
more highly expressed in normal lung tissue compared to lung tumor tissue." Specifically, the 
Examiner states that the limitation provides no information "regarding the level of expression, 
activity, or role in cancer of the PR0874 polypeptide" Office Action at 21. The Examiner 
repeats his arguments from the utility rejection, stating that he has cited countervailing evidence 
that protein levels are not always consistent with mRNA levels. Id, at 21-22. 

While Appellants do not accept these arguments, Claims 4 and 5 which recited the 
limitations at issue have been canceled, and Claim 12, which depended from Claim 4, has been 
amended to depend fi-om Claim 6. Claims 12-13 as dependent from Claim 6 do not recite 
percent amino acid sequence identity as a limitation, nor do they recite any limitation regarding 
overexpression in lung tumors. These claims are directed to fusion peptides of the disclosed 
sequence. Therefore, the amendments to the pending claims have rendered these arguments 
moot. 

With respect to Claims 14-17, which recite the limitation "wherein said isolated 
polypeptide or a fragment thereof can be used to generate an antibody which can be used to 

specifically detect the polypeptide of SEQ ID NO: 10 in lung tissue samples," the Examiner states 
that "[tjhese claims encompass any and all antigenically cross-reactive polypeptides possessing 
the recited percent identity, regardless of their biological activity." Id. at 22, The Examiner 
continues, arguing: 

If mere antigenic cross-reactivity were the test for enablement under § 112, 
Applicants could obtain patent rights that may confer power to block off whole 
areas of scientific development related to the biological activity of the 
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polypeptide, for which Applicants have not provided any disclosure. It is entirely 
unclear why the disclosure of a single polypeptide, i.e., PR0874 (SEQ ID NO: 10), 
which is ideally suited to the making of antibodies to itself, would enable any and 
all antigenically cross-reactive polypeptides possessing the recited percent identity 
and possessing unknown and undisclosed biological activities, when the 
specification provides no disclosure of any biological activity. Therefore, the 
scope of enablement provided to the skilled artisan by the disclosure is not 
commensurate with the scope of protection sought by the claims. Id. at 22-23. 

The standard for determining whether the specification meets the enablement requirement 
is to be evaluated based on whether or not the experimentation needed for one skilled in the art to 
practice the invention would be undue. Mineral Separation v. Hyde, 242 U.S. 261, 270 (1916); 
In re Wands, 858 F.2d 731, 737, 8 U.S.P.Q.2d 1400, 1404 (Fed. Cir. 1988); M.P.E.P. § 2164.01. 
Appellants submit that in view of the requirements of enablement under 35 U.S.C. §112, first 
paragraph, the Examiner has failed to establish a prima facie basis for rejecting Claims 14-17 as 
lacking enablement. The Examiner's unsupported conclusory statements fail to establish a 
reasonable basis to question the enablement provided for the claimed invention. See M,P.E,P. § 
2164.04. 

It is incumbent for the PTO "to explain why it doubts the truth or accuracy of any 
statement in a supporting disclosure and to back up assertions of its own with acceptable 
evidence or reasoning which is inconsistent with the contested statement." Id. (quoting In re 
Marzocchi, 439 F.2d 220, 224, 169 U.S.P.Q. 367, 370 (C.C.P.A. 1971). This can be done "by 
making specific findings of fact, su pported by the evidence , and then drawing conclusions based 
on these findings of fact." Id, The Examiner has failed to make any specific findings of fact, or 
back up his assertions with any acceptable evidence or reasoning. 

In the present case, the Examiner reasons that "[i]f mere antigenic cross-reactivity were 
the test for enablement under § 1 12, Applicants could obtain patent rights that may confer power 
to block of f whole areas of scientific development related to the biological activity of the 
polypeptide, for which Applicants have not provided any disclosure." This is not the test for 
enablement. 

The M.P.E.P. states that "if any use is enabled when multiple uses are disclosed, the 
application is enabling for the claimed invention." M.P.E.P, § 2164.01(c) (emphasis added). As 
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described above, the specification adequately discloses how to make and use the polypeptides of 
Claims 14-17. The Examiner has not alleged that undue experimentation would be required to 
practice the claimed invention - the proper standard for lack of enablement. Accordingly, it 
remains unquestioned that the claimed polypeptides have an enabled use. 

Asserting that "[i]t is entirely unclear why the disclosure of a single polypeptide, ... 
would enable any and all antigenically cross-reactive polypeptides possessing the recited percent 
identity and possessing unknown and undisclosed biological activities, when the specification 
provides no disclosure of any biological activity," is also not a reason to reject the claimed 
polypeptides as lacking enablement. The subject matter of Claims 14-17 relates to isolated 
polypeptides with at least 95% identity to the disclosed polypeptides wherein the claimed 
polypeptides can be used to generate an antibody which can be used to specifically detect the 
polypeptide of SEQ ID NO: 10 in lung tissue samples. The Examiner has not offered any 
explanation of how the failure to disclose "any biological activity" results in one of skill in the art 
having to resort to undue experimentation to practice the claimed invention. Disclosure of a 
"biological activity" is not required for one of skill in the art to either make or use the claimed 
polypeptides. 

Further, "[t]he presence of only one example should never be the sole reason for rejecting 
claims as being broader than the enabling disclosure, even though it is a factor to be considered 
along with all the other factors. To make a valid rejection, one must evaluate all the facts and 
evidence and state why one would not expect to be able to extrapolate that one example across 
the entire scope of the claims." M,P,E.P. § 2164.02. The Examiner provides no other basis for 
rejection of the Claims 14-17 aside from pointing to "disclosure of a single polypeptide," 
Accordingly, pointing to the disclosure of a single polypeptide, absent any other evidence, cannot 
support prima facia rejection of lack of enablement of the claimed polypeptides. 

Notwithstanding the failure of the Examiner to provide sufficient evidence to support a 
prima facie rejection of Claims 14-17, as explained above, the specification teaches in detail how 
to make the claimed polypeptides, including variants thereof, and antibodies which specifically 
bind PR0874. Likewise, as detailed above, the specification provides sufficient guidance as to 
how to use the claimed polypeptides. Thus, there is significant guidance how to make and use 
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the claimed polypeptides. In addition, as the disclosure and references cited in the specification 
make clear, the production of polypeptides, polypeptide variants, and specific antibodies is a 
predictable and well established aspect of the biological sciences. See, e.g., In re Wands, 858 
F.2d 731, 8 U.S.P.Q. 2d 1400 (Fed. Cir. 1988) (reversing the Board's decision of non- 
enablement and holding that as of 1980, undue experimentation was not required to make high- 
affinity monoclonal antibodies to a target peptide). 

In conclusion, the Examiner has failed to meet his burden to establish a reasonable basis 
to question the enablement provided for the claimed invention - conclusory statements are 
simply not sufficient. 

6. Appellants' Response and Grouvins of Rejected Claim - Rejected Claims 6-7, 9 

and 11-1 7 are Enabled 
Appellants submit that for the reasons stated above, the rejected claims are enabled by the 
specification as filed. The Examiner has failed to "to explain why [he] doubts the truth or 
accuracy of any statement in a supporting disclosure and to back up assertions of [his] own with 
acceptable evidence or reasoning which is inconsistent with the contested statement." M.P.E.P. 
§ 2164.04. Instead, the Examiner impermissibly relies on his own personal opinion that "it is 
unclear" how the disclosure enables the claimed subject matter. Id. ("The examiner should 
never make the determination [of whether a claim is enabled] based on personal opinion . The 
determination should always be based on the weight of all the evidence.") (some emphasis 
supplied). 

For purposes of the enablement rejection, Claims 6-7, 9 and 11-13 can be considered as a 
group, Claims 14 and 16-17 can be considered as a group, and Claim 15 should be considered 
individually. 

a. Claims 6-7, 9 and 11-13 are enabled 
Claims 6-7, 9 and 1 1-13 are enabled for the reasons discussed above. The scope of these 
claims is narrow, and because the sequences of SEQ ID NO: 10 and ATCC deposit 209922 are 
disclosed in the specification, no experimentation of any kind is required to make the claimed 
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polypeptides. One of skill in the art would clearly be able to use these polypeptides to make 
antibodies which are specific to these polypeptides, such that the level of expression of these 
polypeptides could be assessed in lung tissue. The only question is whether the use of these 
polypeptides to make antibodies to detect their expression level is a substantial and specific 
utility. For the reasons discussed at length above, Appellants believe that differential expression 
of the PR0874 mRNA in lung tumors provides the claimed polypeptides with a substantial and 
specific utility. Therefore, Claims 6-7, 9 and 11-13 are enabled. 

b. Claims 14 and 16-1 7 are enabled 

Claims 14 and 16-17 are enabled for the reasons discussed above. Because the longest 
recited portion of SEQ ID NO: 10 is only 288 amino acids long, a polypeptide which is at least 
95% identical can only have approximately 14 deletions or substitutions. While some 
experimentation will be required to make these polypeptides that are not identical to the recited 
portions of SEQ ID NO: 10, any such experimentation is routine in the art and will not be undue. 
One of skill in the art would clearly be able to use these polypeptides to make antibodies which 
are specific to SEQ ID NO: 10, such that the expression level of PR0874 can be assessed in lung 
tissue. The only question is whether the use of these polypeptides to make antibodies to detect 
the expression level of PR0874 is a substantial and specific utility. For the reasons discussed at 
length above, Appellants believe that it is, and therefore, Claims 14 and 16-17 are enabled. 

c. Claim 15 is enabled 

Claim 15 is enabled for the reasons discussed above. The scope of this claim is narrower 
than that of Claim 14 since a polypeptide which is at least 99% identical can only have 
approximately 3 deletions or substitutions. As a result, less experimentation will be required to 
make these polypeptides, although any experimentation remains routine. One of skill in the art 
would clearly be able to use these polypeptides to make antibodies which are specific to SEQ ID 
NO: 10, and for the reasons discussed above, Appellants believe that such use is substantial and 
specific. Therefore, Claim 15 is enabled. 
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7. Enablement - Conclusion 

For the reasons discussed above, the specification enables the scope of the claimed 
invention such that one skilled in the art could make and use the claimed invention without 
undue experimentation. The Examiner has offered only conclusory statements, and has failed to 
back up his assertions "with acceptable evidence or reasoning which is inconsistent with the 
contested statement [of enablement]." Id (quoting In re Marzocchi, 439 F.2d 220, 224, 169 
U.S.P.Q. 367, 370 (C.C.P.A. 1971). Therefore, the Examiner has failed to meet his initial burden 
to establish a reasonable basis to question the enablement provided for the claimed invention. 
SeeM,P.KP. §2164.04. 

And even if the Examiner has met his burden. Appellants have presented persuasive 
arguments, supported by the evidence discussed above with respect to utility, that one skilled in 
the art would be able to make and use the claimed invention using the application as a guide. 
Appellants remind the Board that "[t]he evidence provided by applicant need not be conclusive 
but merely convincing to one skilled in the art." M.P.E.P. § 2164.05 (emphasis in original). 

Appellants submit that rejected Claims 6-7, 9 and 11-13 should be considered as a group, 
Claims 14 and 16-17 should be considered as a group, and Claim 15 should be considered 
individually. This is because the scope of the subject matter in each of the groups differs, and 
therefore varying amounts of experimentation will be required to make and use each of the 
groups. However, Appellants submit that any experimentation would be routine for those of skill 
in the art. 

Considering all of the evidence provided by the Appellants to establish their asserted 
utility, along with the disclosure in the specification, the Board should find that Appellants have 
established that one of skill in the art would be able to make and use the claimed invention 
without undue experimentation, and the Examiner's rejection of the pending claims as lacking an 
enabling disclosure should be reversed. 

D, Written Description Rejection - Detailed Argument 

The third issue before the Board is whether the claimed subject matter is described in the 
specification in such a way as to reasonably convey to one skilled in the art that the inventors had 
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possession of the claimed invention at the time the application was filed. The Examiner has 
rejected pending Claims 12-17 under 35 U.S.C. §112, first paragraph, as lacking an adequate 
written description. Office Action at 20. 

The Examiner has failed to meet his initial burden of rebutting the presumption that the 
written description is adequate because the Examiner's only arguments directed to the Claims 12- 
17 are the same unsupported conclusory arguments made to support the enablement rejection of 
the pending claims. For the reasons detailed below, Appellants submit that Claims 12-17 are 
adequately described such that one of skill in the art would recognize that the inventors had 
possession of the claimed invention at the time the application was filed. 

1. Written Description - Lesal Standard 

The well-established test for sufficiency of support under the written description 

requirement of 35 U.S.C. §112, first paragraph is stated by the Court in Vas-Cath, Inc. v. 

Mahurkar, 935 F.2d 1555, 19 U.S.P.Q.2d 1111 (Fed. Cir. 1991): 

"Although [the applicant] does not have to describe exactly the subject matter 
claimed, ... the description must clearly allow persons of ordinary skill in the art 
to recognize that [he or she] invented what is claimed." "The test for sufficiency 
of support in a parent application is whether the disclosure of the application 
relied upon 'reasonably conveys to the artisan that the inventor had possession at 
that time of the later claimed subject matter.'" Vas-Cath Inc, v. Mahurkar, 935 
F.2d at 1562-63, 19 U.S.P.Q.2d at 1 1 16 (citations omitted) 

2. Written Description - Burden of Proof 

The M.P.E.P. states that "[a] description as filed is presumed to be adequate , unless or 
until sufficient evidence or reasoning to the contrary has been presented by the examiner to rebut 
the presumption. See, e.g., In re Marzocchi, 439 F.2d 220, 224, 169 U.S.P.Q. 367, 370 
(C.C.P.A. 1971)." M,P.KP, § 2163.04 (emphasis added). Therefore "[t]he examiner has the 
initial burden of presenting by a preponderance of evidence why a person skilled in the art would 
not recognize in an applicant's disclosure a description of the invention defined by the claims. 
Wertheim, 541 F.2d at 263, 191 U.S.P.Q. at 97." Id, Only then does the Applicant need to 
respond to the Examiner's arguments. 
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3. Written Description - Standard of Proof 

The adequacy of written description support is a factual issue and is to be determined on a 
case-by-case basis. See e.g., Vas-Cath, Inc. v. Mahurkar, 935 F.2d at 1563, 19 U.S.P.Q.2d at 
1116 (Fed. Cir. 1991) (emphasis added). The factual determination in a written description 
analysis depends on the nature of the invention and the amount of knowledge imparted to those 
skilled in the art by the disclosure. Union Oil v. Atlantic Richfield Co., 208 F.3d 989, 996 (Fed. 
Cir. 2000). As with the utility requirement, the evidentiary standard to be used throughout ex 
parte examination in setting forth a rejection is a preponderance of the evidence, or '' more likely 
than not " standard. In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 1444 (Fed. Cir. 
1992). 

4. The Examiner's Arguments Fail to Overcome the Presumption that the 
Claimed Invention is Adequately Described in the Specification 

To overcome the presumption that the claimed subject matter is adequately described, the 
Examiner must present "by a preponderance of evidence why a person skilled in the art would 
not recognize in an applicant's disclosure a description of the invention defined by the claims." 
M.P.E.P. § 2163.04 (emphasis added). To support his rejection of pending Claims 12-17, the 
Examiner has merely repeated, nearly verbatim, the same unsupported conclusory arguments 
made in support of his enablement rejection. 

In response to claims reciting the limitation "wherein said isolated polypeptide is more 
highly expressed in normal lung tissue compared to lung tumor tissue,..." the Examiner repeats 
his assertion that no information is provided "regarding the level of expression, activity, or role 
in cancer of the PR0874 polypeptide." Office Action at 23. The Examiner also repeats his 
arguments from the utility rejection, stating that he has cited countervailing evidence that protein 
levels are not always consistent with mRNA levels. Id. at 23-24. 

While Appellants do not agree with these assertions, Claims 4 and 5 which recited the 
limitations at issue have been canceled, and Claim 12, which depended fi-om Claim 4, has been 
amended to depend from Claim 6. Claims 12-13 as dependent from Claim 6 do not recite 



-73- 



Appl. No. : 10/063,514 

Filed : May 1, 2002 

percent amino acid sequence identity as a limitation, nor do they recite any limitation regarding 

overexpression in lung tumors. These claims are directed to fusion peptides of the disclosed 

sequence. Therefore, the amendments to the pending claims have rendered these arguments 

moot. In the absence of any other arguments as to "why one of skill in the art would not 

recognize in an applicant's disclosure a description of the invention defined by the claims," the 

Examiner has failed to rebut the presumption that the specification satisfies the written 

description requirement for Claims 12-13. See M.P.E.P. § 2163.04. 

With respect to Claims 14-17, which recite the limitation "wherein said isolated 

polypeptide or a fragment thereof can be used to generate an antibody which can be used to 

specifically detect the polypeptide of SEQ ID NO: 10 in lung tissue samples," the Examiner 

repeats his assertion that "[t]hese claims encompass any and all antigenically cross-reactive 

polypeptides possessing the recited percent identity, regardless of their biological activity." Id at 

24. The Examiner continues, arguing: 

Applicants have not described the biologic activity of the PR0874 polypeptide or 
any of its variants. It is entirely unclear why the disclosure of a single 
polypeptide, i.e., PR0874, which is ideally suited to the making of antibodies to 
itself, would describe any and all antigenically cross-reactive polypeptides 
possessing the recited percent identity and possessing unknown and undisclosed 
biological activities, when the specification does not describe any biological 
activity. Id 

The Examiner concludes that as a result, the claimed subject matter is not adequately described. 
Id 

As noted above, "[a] description as filed is presumed to be adequate , unless or until 
sufficient evidence or reasoning to the contrary has been presented by the examiner to rebut the 
presumption." M.P.E.P, § 2163.04 (emphasis added). Therefore "[t]he examiner has the initial 
burden of presenting by a preponderance of evidence why a person skilled in the art would not 
recognize in an applicant's disclosure a description of the invention defined by the claims." Id. 

The Examiner has not provided miy reasoning or evidence as to how the absence of the 
disclosure of "biological activity" results in an inadequate description of the subject matter of 
Claims 14-17. The claimed subject matter relates to polypeptides that have at least 95% 
sequence identity to the recited portions of SEQ ID NO: 10, and can be used to generate an 
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antibody which can be used to specifically detect the polypeptide of SEQ ID NO: 10 in lung tissue 
samples. Appellants fail to see how any "biological activity" of the claimed polypeptides, aside 
from being able to generate an antibody which can specifically detect the polypeptide of SEQ ID 
NO: 10, is at all relevant to an adequate description of the claimed polypeptides which are not 
claimed on the basis of any "biological activity." In the absence of any other arguments as to 
why one of skill in the art would not recognize a description of the claimed invention in 
Appellants' disclosure, the Examiner has failed to rebut the presumption that the specification 
satisfies the written description requirement for Claims 14-17. See M.P.E.P. § 2163.04. 

5. Appellants^ Response and Groupins of Claims - Rejected Claims 12-17 are 

Adequately Described 
The adequacy of written description support is a factual issue and is to be determined on a 

case-by-case basis . See e,g., Vas-Catk Inc. v. Mahurkar, 935 F.2d at 1563, 19 U.S.P.Q.2d at 
1116 (Fed. Cir. 1991) (emphasis added). The factual determination in a written description 
analysis depends on the nature of the invention and the amount of knowledge imparted to those 
skilled in the art by the disclosure. Union Oil v. Atlantic Richfield Co., 208 F.3d 989, 996 (Fed. 
Cir. 2000). 

As discussed above, the Examiner must address each of the pending claims on a case-by- 
case basis. This is because the genuses encompassed by the claims differ, and therefore whether 
or not the specification supports the claimed genus depends on the claim at issue. Appellants 
hereby request that the Board consider the following groupings of Claims 12-17 with respect to 
the written description requirement. 

a. Rejected Claims 12-13 are adequately described 

Claims 12-13, which depend from Claim 6, are adequately described by the specification. 
Claim 6 is directed to an isolated polypeptide comprising the amino acid sequence of the recited 
portions of the polypeptide of SEQ ID NO: 10 or the amino acid sequence of the polypeptide 
encoded by the cDNA deposited under ATCC accession number 209922. 
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As stated above, the Examiner provides no basis for rejecting either of Claims 12-13 
because the Examiner's arguments are directed at claims reciting the limitation "wherein said 
isolated polypeptide is encoded by a polynucleotide that is more highly expressed in normal lung 
tissue compared to lung tumor tissue,..." See Office Action at 23. As amended, Claims 12-13 
depend from Claim 6, which does not recite the objected to limitation. 

Regardless of any lack of reasoning provided by the Examiner, Appellants assert that 
each recited element of Claim 6 is explicitly disclosed in the specification, either in writing {see, 
e.g.. Specification at Figure 10) or by virtue of a biological deposit. Accordingly, there can be no 
basis for holding that Claim 6 is not adequately described. Likewise, Claims 12-13, which are 
drawn to chimeric polypeptides comprising the polypeptide of Claim 6, are also fully described 
by the specification. 

The Examiner does not contest the written description support for any embodiment 
recited in Claims 12-13. Therefore the Examiner has failed to meet his "initial burden of 
presenting by a preponderance of evidence why a person skilled in the art would not recognize in 
an applicant's disclosure a description of the invention defined by the claims." M.P.E.P. § 
2163.04. As such, the Board should reverse the Examiner's rejection of Claims 12-13 under 35 
U.S.C. § 1 12, first paragraph, for lack of written description. 

b. Rejected Claim 14, 16 and 1 7 are adeguately described 
Claims 14, 16 and 17 are adequately described by the specification. Claim 14 is directed 
to an isolated polypeptide having at least 95% amino acid sequence identity to the amino acid 
sequence of the recited portions of SEQ ID NO: 10 or the amino acid sequence of the polypeptide 
encoded by the cDNA deposited under ATCC accession number 209922; wherein said isolated 
polypeptide or a fragment thereof can be used to generate an antibody which can be used to 
specifically detect the polypeptide of SEQ ID NO: 10 in lung tissue samples. Claims 16 and 17 
ultimately depend from Claim 14. 

Appellants maintain that there is no substantial variation within the species which fall 
within the scope of the rejected claims, which require at least 95% amino acid sequence identity 
to the recited portions of SEQ ID NO: 10 and can be used to generate antibodies which 
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specifically detect the polypeptide of SEQ ID NO: 10 in lung tissue samples. As such, Appellants 
were in possession of the common attributes or features of the claimed subject matter. 

The rejected claims are analogous to the claims discussed in Example 14 of the written 
description training materials available on the PTO's website. In Example 14, the written 
description requirement was found to be satisfied for claims directed to polypeptides with 95% 
homology to a disclosed sequence that also possess a recited catalytic activity, where procedures 
for making variant proteins were routine in the art and the specification provided an assay for 
detecting the recited catalytic activity of the protein. This disclosure satisfies the written 
description requirement even though the a pplicant had disclosed only a single species and had 
not made any variants . The Guidelines state that "[t]he single species disclosed is representative 
of the genus because all members have at least 95% structural identity with the reference 
compound and because of the presence of an assay which applicant provided for identifying all of 
the at least 95% identical variants of SEQ ID NO: 3 which are capable of the specified catalytic 
activity ." 

Similarly, the pending claims also have at least 95% sequence identity to the disclosed 
sequence, and must be able to generate antibodies which specifically detect the polypeptide of 
SEQ ID NO: 10 in lung tissue samples. As in Example 14, at the time of the effective filing date 
of the instant application, it was well known in the art how to make polypeptides with at least 
95% amino acid sequence identity to the disclosed sequences. See, e.g., Specification at 
Till [0256]-[0271]. In addition, the specification discloses in detail how to make antibodies which 
specifically detect a particular PRO polypeptide, and how to use them to detect the PRO 
polypeptide in a particular tissue. See, e.g.. Specification [0363]-[0379], [0407], and [0493]- 
[0499]. Like a particular catalytic activity, the function of being useful to produce an antibody 
specific to SEQ ID NO: 10 is directly related to the structure of the claimed polypeptides. Thus, 
like Example 14, the genus of polypeptides that have at least 95% amino acid sequence identity 
to the disclosed sequences and possess the described functional activity are adequately described. 

Claims 16 and 17, drawn to particular embodiments of Claim 14, are also fiilly described 
by the specification. The Examiner does not contest the written description support for any 
embodiment recited in Claims 16-17. 
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The Examiner asserts that the present claims are not analogous to those discussed in 
Example 14 of the written description guidelines because the specification does not describe any 
biological activity of the claimed polypeptides and because the claims are not limited to any 
specific "biological activity" of the claimed polypeptides. Office Action at 24-25. 

Appellants submit that the applicability of Example 14 is not limited to polypeptides for 
which the biological function is known and recited, but extends to all situations where the 
polypeptide is useful and there is no substantial variation wdthin the species encompassed by the 
claims. The purpose of the recited catalytic activity in the example is to limit the amount of 
structural variation within the species. The commentary in the Guidelines states that the 
description of an assay to detect variants which have the recited activity, along with 95% 
homology, is sufficient to satisfy the written description requirement. 

Similarly, in the instant case. Claims 14-17 must share a particular "biological activity" 
which restricts the amount of permissible structural variation within the species - the claimed 
polypeptides must be useable to generate an antibody which can be used to specifically detect the 
polypeptide of SEQ ID NO: 10 in lung tissue samples. This limitation combined with the 
disclosure of how to make and test the recited antibodies generated from the claimed 
polypeptides, along with the requirement of least 95% amino acid sequence identity, results in 
claimed subject matter where there is no substantial variation within the species encompassed by 
the claims. Accordingly, Appellants maintain that the pending claims are analogous to the 
claims in Example 14. 

As for the Examiner's conclusory, and unsupported statement that "[i]t is entirely unclear 
why the disclosure of a single polypeptide, i.e., PR0874, ...would describe any and all 
antigenically cross-reactive polypeptides possessing the recited percent identity," the basic 
premise that a large genus can not be adequately described by a single species is simply wrong. 
In a recent Federal Circuit decision. In re Wallach, 378 F.3d 1330, 1333-34 (Fed. Cir. 2004), the 
Court stated: 

[W]e agree with Appellants that the state of the art has developed such that the 
complete amino acid sequence of a protein may put one in possession of the Rcnus 
of DNA sequences encoding it , and that one of ordinary skill in the art at the time 
the '129 application was filed may have therefore been in possession of the entire 
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genus of DNA sequences that can encode the disclosed partial protein sequence, 
even if individual species within that genus miRht not have been described or 
rendered obvious. ... A claim to the genus of DNA molecules complementary to 
the RNA having the sequences encompassed by that formula, even if defined only 
in terms of the protein sequence that the DNA molecules encode, while containing 
a large number of species, is definite in scope and provides the public notice 
required of patent applicants . 

Moreover, we see no reason to require a patent applicant to list every possible 
permutation of the nucleic acid sequences that can encode a particular protein for 
which the amino acid sequence is disclosed, given the fact that it is, as explained 
above, a routine matter to convert back and forth between an amino acid sequence 
and the sequences of the nucleic acid molecules that can encode it. Id. (emphasis 
added). 

The Court did not require the applicants in Wallach to actually make or individually 
describe all of the vast number of sequences which encode the disclosed protein sequence. This 
is in spite of the fact that only a single protein sequence was disclosed, and the encompassed 
genus was enormous due to codon degeneracy in the genetic code - even the most skilled artisan 
could not individually envision the detailed chemical structure of the nucleic acids encompassed 
by the claimed genus. The Court reasoned that because it is routine to convert between amino 
acid sequences and nucleic acid sequences, disclosure of a single amino acid sequence was 
sufficient to place the applicants in possession of the enormous genus of nucleic acids which 
could encode the sequence. 

The facts in Wallach are very similar to the instant case. Here, Appellants have disclosed 
SEQ ID NO: 10, and claim polypeptides which are at least 95% identical to portions of it and 
have the functional limitation of the ability to generate antibodies which can be used to 
specifically detect SEQ ID NO: 10 in lung tissue samples. As discussed above, it is routine in the 
art to create polypeptides which have at least 95% sequence identity to SEQ ID NO: 10 - it is just 
as predictable and easy as creating all of the nucleic acids which encode a particular amino acid 
sequence. Similarly, it is well within the knowledge of those skilled in the art how to determine 
which polypeptides can be used to make the recited antibodies. The predictability of this 
structure/function combination is sufficient to place the claimed subject matter in the possession 
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of the Appellants, and thus the claimed polypeptides are adequately described. The Wallach 
opinion makes clear that there is no need to literally describe more than a single species to 
adequately describe a large genus where one of skill in the art recognizes that the disclosed 
species puts the applicant in possession of the claimed genus. 

In conclusion, the Examiner has failed to meet his "initial burden of presenting by a 
preponderance of evidence why a person skilled in the art would not recognize in an applicant's 
disclosure a description of the invention defined by the claims." M.P.E.P. § 2163.04. And even 
if he has met this burden, Appellants responded with arguments and evidence which are 
sufficient to overcome the Examiner's conclusory statements and unsupported opinions. As 
such, the Board should reverse the Examiner's rejection of Claims 14 and 16-17 under 35 U.S.C. 
§112, first paragraph, for lack of written description. 

c. Rejected Claim 15 is adequately described 
For the reasons discussed above regarding Claims 14 and 16-17, Appellants believe that 
Claim 15 is also adequately described. However, because the longest recited portion of SEQ ID 
NO: 10 is 288 amino acids long, a polypeptide which is at least 99% identical can only have 
approximately 3 deletions or substitutions. As a result, the genus of polypeptides encompassed 
by Claim 15 is smaller than that of Claim 14, and the Board should consider the adequacy of the 
written description for this claim independently of the other claims. Given the lack of any 
substantial arguments by the Examiner to support his rejection, as well as his failure to address 
Claim 15 directly, the Board should reverse the Examiner's rejection of Claim 15 under 35 
U.S.C. § 1 12, first paragraph, for lack of written description. 

6. Written Description - Conclusion 

In conclusion, the Board should reverse the Examiner's written description rejection of 
Claims 12-17 because the Examiner has failed to rebut the presumption that the claims are 
adequately described, as he has failed to provide any evidence or reasoning to support the 
rejection: "[a] description as filed is presumed to be adequate , unless or until sufficient evidence 



-80- 



Appl. No. 
Filed 



10/063,514 
May 1, 2002 



or reasoning to the contrary has been presented by the examiner to rebut the presumption." 
M,P,KP, § 2163.04 (emphasis added). 

And even if the Examiner's arguments are sufficient to rebut the presumption, Appellants 
submit that they have satisfied the written description requirement for rejected Claims 12-17 
based on the actual reduction to practice of SEQ ID NO: 10, by specifying a high level of amino 
acid sequence identity, and by describing how to make and use antibodies to the disclosed 
sequence. These facts are directly analogous to those of Example 14 of the Written Description 
Guidelines published by the PTO. In addition, like In re Wallach, the description of the single 
species SEQ ID NO: 10 is sufficient to place the Appellants' in possession of the claimed genus 
because those of skill in the art recognize the correlation between polypeptide structure and the 
ability to generate specific antibodies. Appellants submit that the instant disclosure allows one 
of skill in the art to "recognize that the applicant was in possession of the necessary common 
attributes or features of the elements possessed by the members of the genus." Hence, 
Appellants respectfully request that the Board reverse the Examiner's written description 
rejection of Claims 12-17 under 35 U.S.C. §112, first paragraph. 

E. New Matter Rejection - Detailed Arguments 

The final issue before the Board is whether amendments to the claims are properly 
supported by the specification as filed. The Examiner has rejected pending Claims 6-7, 9 and 1 1- 
17 under 35 U.S.C. §112, first paragraph, as containing new matter. Office Action at 27. 

With respect to amino acid species starting at methionine residue #34, the Examiner 
argues that there is no "express, implicit, or inherent support for this species to the exclusion of 
all other species." Id. at 25. With respect to the limitations reciting residues 81-109 and 232-253 
of SEQ ID NO: 10, the Examiner argues that "the specification does not disclose a fragment of a 
PRO polypeptide that is the intracellular domain. The newly added limitations '81-109' and 
'232-253' imply that '81-109' and '232-253' are the extracellular domains." Id at 26-27. 

As explained in more detail below, paragraph [0196] of the specification states that "other 
methionine residues located either upstream or downstream from the amino acid position 1 in the 
figures may be employed as the starting amino acid residue for the PRO polypeptides." 
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Specification at T| [0196] (emphasis added). Figure 10 and SEQ ID NO: 10 show 8 methionine 
residues in the sequence, one of which begins with the methionine at residue #34. Combining 
the statement in paragraph [0196] with the disclosure of SEQ ID NO: 10, Appellants have 
conveyed with reasonable clarity to those skilled in the art that, as of the filing date sought, 
Appellants were in possession of polypeptides of SEQ ID NO: 10 beginning at any of the 
methionine residues listed in SEQ ID NO: 10. Therefore, Appellants were clearly in possession 
of "the polypeptide having the amino acid sequence of amino acids 34-321 of SEQ ID NO: 10" 
at the time of filing, and the related amendments do not constitute new matter. 

With respect to the amendment "the polypeptide having the amino acid sequence of 
amino acids 81-109 or 232-253 of SEQ ID NO: 10," Figure 10 discloses the following fragments 
of SEQ ID NO:10: amino acids 57-80, 110-126, 215-231, and 254-274. Implicit in the 
disclosure of fragments 57-80 and 1 10-126 of SEQ ID NO: 10 is the disclosure of the fragment 
81-109 of SEQ ID NO:10. Likewise, the disclosure of fragments 110-126 and 215-231 of SEQ 
ID NO:10 implicitly discloses the fragment 232-253 of SEQ ID NO:10. Therefore, in light of 
Figure 10, Appellants were clearly in possession of the subject matter "the polypeptide having 
the amino acid sequence of amino acids 81-109 or 232-253 of SEQ ID NO: 10" at the time of 
filing, and the amendments do not constitute new matter. 

!• The Examiner Arsuments Fail to Overcome the Presumption that the 
Claimed Invention is Adequately Described in the Specification 

Paragraph [0196] of the specification states that "it is conceivable and possible that other 
methionine residues located either upstream or downstream from the amino acid position 1 in the 
fifiures may be employed as the starting amino acid residue for the PRO polypeptides ." 
Specification at ^ [0196] (emphasis added). Figure 10 and SEQ ID NO: 10 show 8 methionine 
residues in the sequence. Combining the statement in paragraph [0196] with the disclosure of 
SEQ ID NO: 10, Appellants have conveyed with reasonable clarity to those skilled in the art that, 
as of the filing date sought, Appellants were in possession of polypeptides of SEQ ID NO: 10 
beginning at any of the methionine residues listed in SEQ ID NO: 10. One of the these 
polypeptides is the one which begins with the methionine at residue #34. Therefore, Appellants 
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were clearly in possession of "the polypeptide having the amino acid sequence of amino acids 
34-321 ofSEQIDNO: 10" at the time of filing. 

Appellants have also described in Figure 9, SEQ ID N0:9 and the cDNA deposited under 
ATCC accession number 209922, a cDNA sequence which encodes the entirety of the 
polypeptide of SEQ ID NO: 10, As such, SEQ ID N0:9 inherently discloses the coding sequence 
of the polypeptides of SEQ ID NO: 10 which start at any of the eight methionine residues, 
including the polypeptide which begins at methionine #34. One of skill in the art would clearly 
recognize that methionine #34 is encoded by the codon beginning at nucleotide 100 of SEQ ID 
N0:9, and that the stop codon ends at nucleotide 966. Therefore, the polypeptide of SEQ ID 
NO: 10 which begins at residue #34 is encoded by nucleotides 100-966 of SEQ ID N0:9 and 
cDNA deposited under ATCC accession number 209922. Appellants were therefore clearly in 
possession of "the amino acid sequence of the polypeptide encoded by nucleotides 100-966 of 
the cDNA deposited under ATCC accession number 209922" at the time of filing. 

The Examiner acknowledges that paragraph [0196] discloses that methionine residues 
upstream or downstream of the amino acid in position 1 may be the start amino acid. However, 
the Examiner appears to argue that because there is more than one methionine residue in SEQ ID 
NO: 10, Appellants have not adequately described any of the possible polypeptides of SEQ ID 
NO: 10 beginning at a methionine residue. The Examiner states that "the species methionine 
residue #34 as the starting amino acid is not supported by this generic disclosure because there is 
no express, implicit, or inherent support for this species to the exclusion of all the other species . 
In other words, there is no evidence that the disclosure would reasonably lead the skilled artisan 
to this particular species ." Office Action at 25 (emphasis added.) 

Appellants submit that this argument misstates the test for compliance with the written 
description requirement. The test is "whether the specification conveys with reasonable clarity to 
those skilled in the art that, as of the filing date sought, applicant was in possession of the 
invention as now claimed ." M.P.E.P, §2163.02 (internal citations omitted, emphasis added). 
Clearly, as discussed above, at the time of filing Appellants were in possession of the polypeptide 
of SEQ ID NO: 10 starting at methionine #34 and the nucleic acid sequence which encodes this 
polypeptide. Contrary to the Examiner's assertion, where Appellants have adequately described 
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several polypeptides related to SEQ ID NO: 10, there is nothing in the written description 
requirement which prevents the Appellants from claiming only one of them. 

Therefore, Appellants submit that the Examiner has failed to meet its initial burden of 
"presenting by a preponderance of evidence why a person skilled in the art would not recognize 
in an applicant's disclosure a description of the invention defined by the claims" with regard to 
this subject matter. M.P.E.P. §2163.04 (internal citations omitted, emphasis added). 

Finally, Figure 10 discloses the following fragments of SEQ ID NO: 10: amino acids 57- 
80, 110-126, 215-231, and 254-274. Implicit in the disclosure of fragments 57-80 and 110-126 
of SEQ ID NO:10 is the disclosure of the fragment 81-109 of SEQ ID NO:10. Likewise, the 
disclosure of fragments 1 10-126 and 215-231 of SEQ ID NO: 10 implicitly discloses the fragment 
232-253 of SEQ ID NO: 10. Therefore, in light of Figure 10, Appellants were clearly in 
possession of the subject matter "the polypeptide having the amino acid sequence of amino acids 
81-109 or 232-253 of SEQ ID NO: 10" at the time of filing. 

The Examiner argues that while Figure 10 discloses that SEQ ID NO: 10 possesses several 
transmembrane domains, the extracellular domains depend on how the polypeptide is arranged in 
the membrane, and that the amendments imply that "81-109" and "232-253" are the extracellular 
domains. The Examiner concludes that "[s]upport for the one arrangement implied by the 
present limitations cannot be found in the disclosure as originally filed. Hence, the newly added 
limitations constitute new matter." Office Action at 26-27. 

Appellants submit that this rejection is improper because it is a rejection not of the 
subject matter that is claimed, but rather what the Examiner believes is "implied" by the claims. 
The disputed portion of the claims reads "An isolated polypeptide comprising:... (b) the amino 
acid sequence of the polypeptide having the amino acid sequence of amino acids 81-109 or 232- 
253 of SEQ ID NO: 10." Nothing in this claim suggests or implies that the claimed portions of 
SEQ ID NO: 10 are intracellular, extracellular or transmembrane portions of the protein. 
Appellants remind the PTO that the test for written description on which a new matter rejection 
is based is "whether the specification conveys with reasonable clarity to those skilled in the art 
that, as of the filing date sought, applicant was in possession of the invention as now claimed ." 
M.P.KP, §2163.02 (internal citations omitted, emphasis added). Thus, the Examiner is 
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improperly reading limitations into the claim that are not present, and his rejection is therefore 
improper since it is not directed to the claimed subject matter, but to what the Examiner feels is 
"implied" in the claims. In the absence of any other arguments as to why one of skill in the art 
would not recognize a description of the claimed invention in Appellants' disclosure, the 
Examiner has failed to rebut the presumption that the specification satisfies the written 
description requirement with respect to this subject matter. See M.P.E.P. § 2163.04. 

2. Appellants' Response and Groupins of Claims - The Amendments to Rejected 
Claims 6-7. 9 and 11-1 7 Do Not Constitute New Matter 

The adequacy of written description support underlying a new matter rejection is a factual 
issue and is to be determined on a case-bv-case basis . See e.g., Vas-Cath, Inc. v. Mahurkar, 935 
F.2d at 1563, 19 U.S.P.Q.2d at 1116 (Fed. Cir. 1991) (emphasis added). The factual 
determination in a written description analysis depends on the nature of the invention and the 
amount of knowledge imparted to those skilled in the art by the disclosure. Union Oil v. Atlantic 
Richfield Co., 208 F.3d 989, 996 (Fed, Cir. 2000). 

As discussed above, the Examiner must address each of the pending claims on a case-by- 
case basis. This is because the subject matter of the limitations to the claims differ for each 
group of claims, and therefore whether or not the specification supports the claims depends on 
the limitation at issue. Appellants hereby request that the Board consider the following 
groupings of Claims 6-7, 9 and 11-17 with respect to the written description requirement. 

a. The amendments to rejected Claims 6 and 12-17 are adeauatelv 
described 

For purposes of the new matter rejection under 35 U.S.C. § 1 12, first paragraph. Claims 6 
and 12-17 can be considered as a group. As explained above, paragraph [0196] of the 
specification discloses that other methionine residues can be the start of the PRO polypeptide, 
and Figure 10 and SEQ ID NO: 10 show a methionine at residue #34, Combining the statement 
in paragraph [0196] with the disclosure of SEQ ID NO: 10, Appellants have conveyed with 
reasonable clarity to those skilled in the art that, as of the filing date sought, Appellants were in 
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possession of polypeptides of SEQ ID NO: 10 beginning at any of the methionine residues listed 
in SEQ ID NO: 10, including the one at position 34. Therefore, Appellants were clearly in 
possession of "the polypeptide having the amino acid sequence of amino acids 34-321 of SEQ ID 
NO: 10" at the time of filing, and the related limitations do not constitute new matter. 

Similarly, as explained above, one of skill in the art would recognize that the polypeptide 
of SEQ ID NO: 10 which begins at residue #34 is encoded by nucleotides 100-966 of SEQ ID 
N0:9 and cDNA deposited under ATCC accession number 209922. Appellants were therefore 
clearly in possession of "the amino acid sequence of the polypeptide encoded by nucleotides 100- 
966 of the cDNA deposited under ATCC accession number 209922" at the time of filing. 

With respect to the limitation "the polypeptide having the amino acid sequence of amino 
acids 81-109 or 232-253 of SEQ ID NO: 10," Figure 10 implicitly discloses fragments 81-109 
and 232-253 of SEQ ID NO: 10. Therefore, in light of Figure 10, Appellants were clearly in 
possession of the subject matter "the polypeptide having the amino acid sequence of amino acids 
81-109 or 232-253 of SEQ ID NO: 10" at the time of filing, and the amendments do not 
constitute new matter. 

b. The amendment to rejected Claim 7 is adequately described 

The amendment to Claim 7 recites the limitation "the amino acid sequence of amino 

acids 34-321 of SEQ ID NO: 10," and is adequately described for the reasons articulated above. 
Because Claim 7 recites only one of the three objected to limitations, it should be considered 
independently from the other claims. 

c. The amendment to rejected Claim 9 is adeauatelv described 

The amendment to Claim 9 recites the limitation "the amino acid sequence of amino 
acids 81-109 or 232-253 of SEQ ID NO: 10," and is adequately described for the reasons 
articulated above. Because Claim 9 recites only one of the three objected to limitations, it should 
be considered independently from the other claims. 
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d. The amendment to rejected Claim 11 is adeauately described 
The amendment to Claim 1 1 recites the limitation "the polypeptide encoded by 
nucleotides 100-966 of the cDNA deposited under ATCC accession number 209922," and is 
adequately described for the reasons articulated above. Because Claim 1 1 recites only one of the 
three objected to limitations, it should be considered independently from the other claims. 
3, New Matter - Conclusion 

In conclusion, the Board should reverse the Examiner's new matter rejection of Claims 6- 
7, 9 and 11-17 because the Examiner has failed to rebut the presumption that the amendments to 
the claims are adequately described, as he has failed to provide any evidence or reasoning to 
support the rejection: "[a] description as filed is presumed to be adequate , unless or until 
sufficient evidence or reasoning to the contrary has been presented by the examiner to rebut the 
presumption." M,P,E,P, § 2163.04 (emphasis added). 

And even if the Examiner's arguments are sufficient to rebut the presumption, Appellants 
submit that they have satisfied the written description requirement for the limitations to Claims 
6-7, 9 and 11-17. Appellants submit that the instant disclosure "conveys with reasonable clarity 
to those skilled in the art that, as of the filing date sought, applicant was in possession of the 
invention as now claimed." Hence, Appellants respectfully request that the Board reverse the 
Examiner's new matter rejection of Claims 6-7, 9 and 11-17 under 35 U.S.C. §112, first 
paragraph. 

F. Conclusion 

In view of the arguments presented above. Appellants submit that the specification as 
filed provides a specific, substantial and credible utility for the claimed polypeptides, and that the 
claimed subject matter is enabled and adequately described by the specification. Appellants 
therefore respectfully request that the Board reverse the rejections of the pending claims as 
lacking utility under 35 U.S.C. §101, and as not being enabled or adequately described under 35 
U.S.C. §112. 
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VIIL APPENDIX A ■ Claims on Appeal 
1-5. (Canceled). 

6. An isolated polypeptide comprising: 

(a) the amino acid sequence of the polypeptide having the amino acid 
sequence of amino acids 34-321 of SEQ ID NO: 10; 

(b) the amino acid sequence of the polypeptide having the amino acid 
sequence of amino acids 81-109 or 232-253 of SEQ ID NO: 10; or 

(c) the amino acid sequence of the polypeptide encoded by nucleotides 
100-966 of the cDNA deposited under ATCC accession number 
209922. 

7. The isolated polypeptide of Claim 6 comprising the amino acid sequence of the 
polypeptide having the amino acid sequence of amino acids 34-321 of SEQ ID NO: 10. 

8. (Canceled). 

9. The isolated polypeptide of Claim 6 comprising the amino acid sequence of the 
polypeptide having the amino acid sequence of amino acids 81-109 or 232-253 of SEQ ID NO: 
10. 

10. (Canceled). 

1 1 . The isolated polypeptide of Claim 6 comprising the amino acid sequence of the 
polypeptide encoded by nucleotides 100-966 of the cDNA deposited under ATCC accession 
number 209922. 

12. A chimeric polypeptide comprising a polypeptide according to Claim 6 fused to a 
heterologous polypeptide. 

13. The chimeric polypeptide of Claim 12, wherein said heterologous polypeptide is a 
tag polypeptide or an Fc region of an immunoglobulin. 

14. An isolated polypeptide having at least 95% amino acid sequence identity to: 
(a) the amino acid sequence of the polypeptide having the amino acid 

sequence of amino acids 34-321 of SEQ ID NO: 10; 
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(b) the amino acid sequence of the polypeptide having the amino acid 
sequence of amino acids 81-109 or 232-253 of SEQ ID NO: 10; or 

(c) the amino acid sequence of the polypeptide encoded by nucleotides 100-966 of 
the cDNA deposited under ATCC accession number 209922; 

wherein said isolated polypeptide or a fragment thereof can be used to generate an 
antibody which can be used to specifically detect the polypeptide of SEQ ID NO: 10 in 
lung tissue samples. 

15. The isolated polypeptide of Claim 14 having at least 99% amino acid sequence 
identity to: 

(a) the amino acid sequence of the polypeptide having the amino acid 
sequence of amino acids 34-321 of SEQ ID NO: 10; 

(b) the amino acid sequence of the polypeptide having the amino acid 
sequence of amino acids 81-109 or 232-253 of SEQ ID NO: 10; or 

(c) the amino acid sequence of the polypeptide encoded by nucleotides 100-966 of 
the cDNA deposited under ATCC accession number 209922; 

wherein said isolated polypeptide or a fragment thereof can be used to generate an 
antibody which can be used to specifically detect the polypeptide of SEQ ID NO: 10 in 
lung tissue samples. 

16. A chimeric polypeptide comprising a polypeptide according to Claim 14 fused to 
a heterologous polypeptide. 

17. The chimeric polypeptide of Claim 16, wherein said heterologous polypeptide is a 
tag polypeptide or an Fc region of an immunoglobulin. 
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IX. APPENDIX B - Evidence 



Attached hereto is a copy of the evidence cited in Appellants' Brief. The list of evidence 
below is accompanied by a statement setting forth where in the record that evidence was entered 



into the record by the Examiner. 






Tab 


Reference 


Submitted 


Entered 


1 


Hu et al. (J. Proteome 
Res. 2003; 2(4):405-12) 




Cited by Examiner in 
Office Action dated 
10/4/05 


2 


LaBaer (Nature 
Biotechnol. 2003; 
21(9):976-7) 




Cited by Examiner in 
Office Action dated 
10/4/05 


3 


Haynes et al 
(Electrophoresis 1998; 
19(11):1862-71) 




Cited by Examiner in 
Office Action dated 
10/4/05 


4 


Gygi et al (Mol. and 
Cell. Bio. 1999; 
19(3):1720-30) 




Cited by Examiner in 
Office Action dated 

10/4/05 


5 


Yousef et al (Cancer 
Res. 2003; 63(9):2223-7) 




Cited by Examiner in 
Office Action dated 
10/4/05 


6 


First Declaration of J. 
Christopher Grimaldi 


Originally Submitted on 12/7/04 
with Appellants' Amendment and 
Response to Office Action as 
Exhibit 1 


Entered by Examiner in 
Office Action dated 
3/30/05 


7 


Second Declaration by J. 
Christopher Grimaldi 


Originally Submitted on 12/7/04 
with Appellants' Amendment and 
Response to Office Action as 
Exhibit 2 


Entered by Exfuniner in 
Office Action dated 
3/30/05 


8 


Declaration of Paul 
Polakis, Ph.D. 


Oripinallv Suhmitted on 17/7/04 
with Appellants' Amendment and 
Response to Office Action as 
Exhibit 3 


Fntpfpd hv Fxaminer in 
Office Action dated 
3/30/05 


9 


Bruce Alberts, et al , 
Molecular Biology of the 
Cell (3'^ ed. 1994) 


Originally Submitted on 6/29/05, 
with Appellants' Submission filed 
with Request for Continued 
Examination as Exhibit 1 


Entered by Examiner in 
Office Action dated 
10/4/05 


10 


Bruce Alberts, et al^ 
Molecular Biology of the 
Cell (4'*^ ed. 2002) 


Originally Submitted on 6/29/05, 
with Appellants' Submission filed 
with Request for Continued 
Examination as Exhibit 2 


Entered by Examiner in 
Office Action dated 
10/4/05 
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1 1 Genes VI, (Benjamin 

Lewin, Genes VI (1997) 



Originally Submitted on 6/29/05, 
with Appellants' Submission filed 
with Request for Continued 
Examination as Exhibit 3 



Entered by Examiner in 
Office Action dated 
10/4/05 



1 2 Zhigang et al. , World 
Journal of Surgical 
Oncology 2:13,2004 



Originally Submitted on 6/29/05, 
with Appellants' Submission filed 
with Request for Continued 
Examination as Exhibit 4 



Entered by Examiner in 
Office Action dated 
10/4/05 



1 3 Meric et al , Molecular 

Cancer Therapeutics, vol. 
1,971-979 (2002) 



Originally Submitted on 6/29/05, 
with Appellants' Submission filed 
with Request for Continued 
Examination as Exhibit 5 



Entered by Examiner in 
Office Action dated 
10/4/05 
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X. APPENDIX C - Related Proceedings 

None - There are no decisions rendered by a court or the Board in any related 
proceedings identified above. 
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Analysis of Genomic and Proteomic Data Using Advanced Literature 

Mining 

Yanhui Hu, Lisa M. Hines, Haifeng Weng. Dongmel Zuo. Miguel Rivera. 
Andrea Richardson, and Josliua LaBaer* 

Institute of P/oteomics. Harvard Medical Seltoof-DCMP, 240 longwood Avenue. Boston, Massacliusetts 02115 

Received March 13« 2002 

High-throughput technologies, such as proteomic screening and ON A mtero-arrays, produce vast 
amounts of data requiring comprehensive analytical methods to decipher the biologically relevant 
results. One approach would be to manually search the biomedical literature; however, this would be 
an arduous task. We developed an automated literature-mining tool, termed MedGene. which 
comprehensively summarizes and estimates the relative strengths of all human gene-disease 
relalionsNps In Medline. Using MedGene. we analyzed a novel micro-array expression dataset 
comparing breast carwer and normal breast tissue in the context of existing knowledge. We found no 
correlation between the strength of tfie literature association and the magnitude of the difference in 
expression level when considering changes as high as 5-fotd; however* a significant correlation was 
observed (r = 0.41; p = 0.05) among genes showing an expression difference of 10-fold or more. 
Interestingly, this only held true for estrogen receptor (ER) positive tumors, not ER negative. MedGene 
identified a set of relatively understudied, yet highly expressed genes in ER negative tumors worthy of 
further examination. 

Keywords: bioinrormatics • micro-array » text mining ♦ gene-disease association • breast cancer 



Introduction 

kiiXs current pace* Uic accumulaLion of biomedical lilcraiurc 
outpaces the ability of most researchers and clinicians to stay 
abreast of Uiclr own Iminedlate flclds. lot alone cover a broader 
range of topics. For exaniplo. to follow a single disease. e.g„ 
breast cancer, a rcscarclicr would have had to scan 130 diTfcrcnt 
Joumab and read Zl papei^ per day in 19S9.^ This problem Is 
accentuated wiUi high'Uiroughpul icclinologlcs such as DNA 
micro-arrays and protcoinics. which require the analysts of 
lai]ge dalascts involving thousands of genes, many of which are 
unfamiliar to a particular researcher. In any microarray experi- 
ment, thousands of genes may demonstrate statlstlcaUy slg- 
niikant expression changes, but only a fraction of these may 
be relevant to the study. The ability to interpret these dalasets 
would be enhanced if they could be compared to a compre- 
hensive summary of wiiat is Icnown about all genes. Thus, there 
is a need to summarize cxisUng knowledge In a fomiat that 
allows for the rapid analysis oT associations between genes and 
diseases or otlicr specific biological concepts. 

One soluUon to this problem is lo coinpde slnicturcd digital 
resources, sudi as the Breast Cancer Gene Database' and the 
Tumor Gene Database.' f-fowcver. as these resources arc hand- 
curatcd. the labor-intensive review process becomes a ratc- 
liiniting step in growth of iIk* database. As a result, these 

' To wlxmt comaipontlence sliouUI tw mldnaied: JI»hM!»e|ims.luiivafdi.>du. 
iai«2Upr0340227 CCC; 12$M ® 2003 Amoican Chemicat Sodeiy 



databases have a limited scale and the genes arc not sdccled 
in a systematic fashion. 

An alternative approach is automated text mining; a method 
which involves automated information extraction by searching 
documents for text strings and analyang tlielr frequent and 
context This approach has been used successfully in several 
instances for biological applications. In most cases, it iias been 
applied to extract information about the relationships or 
interactions that proteins or genes luive with one another, in 
tlie literature or by funcUonal annotation.'"' Thus Tar. few 
publication have applied text*minlng to examine the global 
relationships between genes and diseases. Perez-Initxeta el al. 
automatically examined the CO (Gerte Ontology) annotation 
of genes and their predicted chromosomal locations In order 
lo identify genes linked to Inherited disorders.* 

To obtain a more global understanding of disease develop- 
ment, it would be valuable to Incorporate information regarding 
all possible gene-'disease relaUonships. including bioclicmical. 
physiological. phamiaoologScal. epidemiological, as well as 
genetic. This Infomiation would enable compreitensivc com- 
parisons between large experimental datasots and existing 
knowledge in the literature. This would accomplish two things. 
First. It would servo to validate experiments by demonstrating 
tiiat known responses occur as predicted. Second, it would 
rapidly highlight wliich genes arc corroborated by tiie literature 
and which genes are novel in a given context We liavc utilized 
a computational approach to literature mining to produce a 
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compiehensive scl of gene-dlseasc rdalionslilps. In addiUon, 
we have developed a novel approach to assess the strength ot 
each association based on the frequency of citation and co- 
dtation. We applied this tool to help interpret the data from a 
targe mlcro-array gene expression experiment comparing 
normal and cancerous breast tissue. 

Methods 

MedCene Database. MedCene is a relalional database, stor* 
ing disease and gene information from NCBI» text mining re- 
sults, statistical scores^ and liyperlinlcs to the primary lit- 
erature. McdGcne has a web-based user Interface for users to 
query the database (hap'y/hip5eqinedi)arvard.cdu/McdCenc/). 

Text Mining Algorithms. MeSH files were downloaded from 
the MeSH web site at NLM (NaUon library of Medicine) (http:// 
www.nlmjfiih.gov/mesh/meshhomc.htmI) and human disease 
categories were selected. LocusLlnk files were downloaded from 
the LocusLlnk web site at NCBI (htlp://www.ncbi. nih.gov/ 
LocusLlnSc/). OfUcial/prefcrred gene symbol, official/preferred 
gene name, and gene alternative symbols and names, all 
relevant armoLations and URLs for each LocusLlnk record, were 
collected. Gene search terms were used for literature searching 
and included all qualiflcd gene names, gene symbols, and gene 
family terms. Primary gene keys, predominantly qualidcd gene 
family terms and gene ofGcial/prefcrrcd symlwls. were used 
to Index Medline records. If the ofndal/preferred gene symt)ols 
did not meet Ihe standards to be an index, then qualiiled gene 
olTicial/prcferTed names were used. A local copy of Medline 
records (up to July, 2002) was prc-selectcd. 

A JAVA module examined the MeSH terms and il^cn indexed 
each Medline record with the appropriate discnse terms. A 
separate JAVA module was used to examine Uie titles and 
abstracts for gene search terms and then to index the gene- 
related Medline records with the relevant primaiy gene key($). 

Statistical Methods. For every gene and disease pair, we 
counted records thai were indexed for both gene and disease 
(double posiUvc hits), for disease only (disease single hits), for 
gene only (gene single hits), and for neither gene nor disease 
(double negathre hits) to generate a 2 x 2 contingency table. 
On the basis of the contingency tablc-framcwork. we applied 
different statistical methods to estimate the strcngUi of gene- 
disease relationships and evaluated the results. These methods 
Included chi-square analysis, Pisher*s exact probabilities, nsla- 
tivc risk of gene, and relaUve risk of disease" (http:// 
hipseq.medJiaryard.edu/McdCcne/). In addition, we computed 
the 'product of frequency*, which Is the product of Ihe 
proportion of disease/gene double hits to disease single hits 
and the proportion of disease/gene double hits to gene single 
hits. To obtain a normal distribution, wo transfonned all the 
statisUcal scores using the natural logaritlun. Wo selected tho 
log of the product of frequency (LPF) to validate MedCene and 
to use for the analysis with tlie micro-array data. Speannan 
rank-correlation coefficients were used to assess the linear 
retaUonship betvi'een LPF and micro-array fold change in 
expression level. 

Global Analysis. Diseases witl) at least 50 related genes were 
selected for clustering analysis, and die LPF scores were 
normalized wlUi total score for each disease. I licmrchlcal 
clustering was done with the •Cluster" software and the 
clustering result was visuaUzed using "TrceViewcr" (hUp:// 
rana.lbl.gov/EiscnSoFlware.htm). 
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Breast Tissue Micro-Arrays. Eighty-nine breast cancer 
samples (79% ER-posltlve) and 7 normal breast tissue samples 
were selected from the Harvard Breast SPORE Cnozen tissue 
repository and were representative of the spectnun of histo- 
logical types, grades, and hormone receptor immuno-pheno- 
typcs of breast cancer. RloOnylatcd cRNA. generated from the 
total RNA extracted from the bulk tumor, was hybridized Lo 
Affymelrix U95A ollgo -nucleotide micro-arrays. These mlao- 
arrays consist of 12 400 probes, which represent approximately 
9000 genes. Raw expression values were obtained using GENE- 
CHIP software from Affymctrix. and U^n further analyaed using 
the DNA-Chip Analyzer (dChlp) custom software. 

Results 

Automated Indexing of Medline Records by Disease and 
Gene. To study the gene-disease associations in the lilerature. 
we first compiled complete lists for human diseases and human 
genes. To Index all Medline records tlwt were relevant to 
human diseases, the Medical Subject Heading (MeSH) index 
of Medline records was udUzed. MeSH is a conUrolIcd medical 
vocabulary from the Nadonal Library of Medicine and consists 
of a set of terms or subject headings that are arranged In both 
an alphabetic and an hierarchical structure. Medline records 
are reviewed manually and MeSH terms are added to each with 
software assistance.'- *° Twenty-three human disease categoiy 
headings along with all of their child terms (see the Supporting 
Information. Supplemental Table 1, or visit http://iilpseq. 
mediiarvard.edu/MedGene/pubIication/sjrabie l.hlml) were 
selected from U^e 2002 MeSH Index creating a list of 4033 
liuinan diseases. 

No index comparable to the MeSH Index exists for genes, 
and thus. It was nccessaiy to apply a string search algorithm 
for gene names or symbols found in Medline texL A complete 
list or genes, gene names, gene symbols, and frequently used 
synonyms were collected from the LocusLinIc database at 
NCBI," " which contains 53 ZS9 independent records Iceyed 
by an oflkial gene symbol or name (June IS***. 2002). For the 
purposes of this study, no distinction was made between genes 
and their gene products. Authors often use tlic same name for 
both. dlflercntlaUng the two only by the use of italics. If at all. 
For the intended use of this study, this lack of distinction Is 
unlikely to have a large effect and may In fact be beneQdal. 

Initial attempts to search the literature using these lists 
revealed several sources oT false posiUves and false negath^es 
(table 1). False positives primarily arose when the searched 
term had other meanings, whereas false negatives arose from 
syntax discrepancies necessitating the development of filters 
to reduce these errors. The syntax issues were readily handled 
by including alternate syntax forms in the search tenris. The 
false positive cases, caused by duplicative and uruelalcd 
meanings for the terms, were more diflkult to manage. Where 
possible, case sensitive stririg mapping reduced Inappropriate 
dtaliofw. In many cases, however, {his was not sufllcient and 
tiie terms liad to be eliminated entirely, thereby reducing the 
false positive rale but uriavoidably undcr-mpresenting some 
genes. 

For die purposes of data tracking, a primary gene key was 
selected to represent all synonyms that correspond to each 
gene. Medline records were indexed with a primary gene key 
when any synonym for that key was found in the UUe or 
abstract Case-insensitive string mapping was used for all 
searches except as noted above. No additional weight was 
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Table 1. Systematic Sources of False PosiUves and False Negatives in Unllltered Data' 



source or cfTor 



enrof type 



example 



Otcf coUitlon 



gene symbol/name 
isnoi unique 



gene symbol is 

unrelated abbreviation 
gene symbol/name 

has language meaning 
nonstandard syntax 
unofficial gene name/symbol 
nonqxdflcd gene name 



false positive M^^^-myelln 

associated g^coprotein 
M/C-malignancy-assodatcd 
protein 

false pasilive Pi4-pallld homologuc (mouse). 

pallidin (ako abbrev. for Pennsylvania) 
false positive W45-Wlskott-Aldrich Syndnmie 

(also the word Vas") 
false negative BAG-i instead of BAG I 
false negative F53 instead of TP53 
false negative estrogen receptor instead of 

Estrogen receptor 1 



elinUnate this teim 

eliminate tlUs term 

case-sensitive string search 

add dash term 

add all gene nlcloiamcs 

add family stem term 



* In preliminary sludles. Medline was searched fot co-<>c£urrcncc of gicncsand diseases and the rcsutUng output was evaluated to Identify efror sources thai 
were axnenat>lo lo glc^l fillers. Each error source is categorized by the type of error it causes: fi&lso positives arc suggested relationships that are not real and 
false negsilves are real rdallonshJps that nrt undemepresented. The filter soIuUons used ace Indicated. Note that In some cases, the filter soluUon Itself Introduces 
error. In general, error rates maximized sensitivity, even at the expense of specificity If needed. 



added for multiple occurrences of a term or the Co-occurrence 
of mtiitJptc synonyms for the same gene key. 

Medline records were searched with all qualified gene 
Identifiers, such as the offlclal/prcferred gene symbol the 
ofHcjal/prefcrred gene name, all gene nicknames and all syntax 
variants. In situations where there are several members of a 
gene family or splice variants, some authors prefer Lo use a 
shortened gene family name. c.g., estrogen receptor instead of 
estrogen receptor 1 (ESRt), creating a source of false negatives. 
For this reason, gene family stem terms were created for all 
genes that have an alpha or numerical sullSx (eg.. JL2RA, TCFfi, 
ESRJ, etc.) and then used to search the literature. The family 
stem icfms were handled separately from the spedflc gene 
names so that It would be clear when linkages were made to 
the gene family versus a spcciQc member In that family. 

To improve peifonnance and accuracy, some jwe^selectlon 
was applied to the records that were scanned. First, review 
articles were eliminated to avoid redundant treatment of 
citations. Second, non-EngUsh journals were removed because 
the natural language fillers were only relevant to English 
publications. Finally, Jountab unlikely to contain primary data 
atraut gene-disease relationships were also removed <e.g., InL 
I Health Edue,, Bedstek Nurse, and / Heaith Eton), Together, 
these raurs reduced the 12 198 221 MedUne publlcalions Ouly 
2002) by 37%. 

Ranking the Relative Strengths of Gcnc-Disease Associa- 
tions. In total, there were 618 708 gene-disease co-citations, 
in which 1 e% (8297) of all studied genes had been associated 
to a disease and 96% (3875) of all diseases had been associated 
to at least one gene. To rank the relative strengths of gene 
disease relationships, we tested several different statistical 
methods and examined the results. With tlie exccptton of the 
relative risk estimates, the methods provided similar results 
with respect to the ronk order of the gcnc« disease association 
strengths. However, after comparing the results to oUier 
databases and after consulting disease experts, the tog of the 
product of frequency (LPF) was selected for further analysis 
because it gave the best results overall. 

Validation of MedGcnc. In developing this tool, it vras 
Iniportant to minimize the number of missed genes (false 
negatWes) and miscalled genes (false posiUvcs). However. In 
situattons when these goals were in conflfart. inclusiveness was 
prioritized. To determine tiie false negative rate in MedGcnc. 
breast cancer was used as a test case because It was associated 
with more genes tlian any otlicr human disease and because 




Figure 1. Estimation of the false negative rate by comparison 
with hand*curated datdbases. The breast cancer-related genes 
Identified by MedGene were compared with those listed In 
several other databases Including the Tumor Gene Database 
(TGD).z the Breast Cancer Gene Daiabase(BCG).* GeneCards 
(GC)^' and Swissprot." Genes were considered false negatives 
if they were represented In at least one of these other databases 
and not in MedGene and their link to breast cancer was sup* 
ported by at least one fiterature reference. All literature references 
were verified by manual review to confirm their validity. The 
number of genes In each database or shared by more than one 
database is Indicated. The false negative rate was calculated by 
genes missed at MedGene (26)/total number of nonoverlapping 
genes in other databases (285). 

tiiere were several publk databases that link genes to breast 
cancer. We compared the Ifet of breast cancer-related genes 
from MedGene to these databases, Illustrated in Figure 1. 
Among the 285 distinct breast canccr-rclatcd genes that were 
supported by at least one literaiure citation In these hand- 
curatcd databases. 26 were absent from McdCone. suggesting 
a false negative rate of approximately D%. To dctcmiiiie why 
these were missed, all Uierature references for these genes (80 



Journal of Pioleome Research • Vol. 2, No. 4, 2003 4D7 



research articles 



HuetaL 



papeis) were reviewed manually (see the Supporting Inibrma- 
Uoa Supplemental Table 2, or vbll htlp;//hlpseq.med. 
harvaid.edu/MedGene/pubUcation/sjrable SLhtrnQ. Among 
these papers, most false negatives were caused by nonstandard 
gene tenns or gene terms eliminated by our speclfidfy filters. 
Few genes were missed because they were only mentioned In 
review papers (0.4%) or they appeared only in the body of the 
manuscript but not the abstract or Utlc (1.1%). Of note, 
MedCene Identified approximately 2000 additional breast 
cancer-related genes ru>t listed in any other database. 

To assess the fiilse positive error rate, two complementary 
approaches were used: a detailed analysis of one disease and 
a global examination of 1000 diseases. The detailed approach 
examined Ihc false positive error rate and its sources, whereas 
the global approach tested whether the overall results made 
biomedical sense. 

Using the LPF, 1467 genes related to prostate cancer were 
assembled In rank order. We then retrieved approximately 300 
Medline records each for the highest ranked 100 and the lowest 
ranked 200 genes and manually reviewed the UUes ami 
abstracts to determine the verily of Uie association. Nearly 80% 
of the highest ranked 100 genes fell Into one of the five 
categories that reQect meaningful gcne-dlsease relationships 
(see the Supporting Information, Supplemental Table 3. or visit 
hltp://hlpscq.med.harvard.edu/MedGcne/pubUcation/ 
sJTable 3.html). Among the lowest ranked 200 genes, ap- 
proximately 70% reflected tnie relationships. Of the 600 records 
reviewed, ^l^&rc were only two in which tiie association between 
the gene and the disease was described as negative. Both were 
genes with very low scores. In both cases, the authors did not 
argue the absence of any relationship, but rather that a 
particular feature of the gene or protein was not shown to be 
related to human prostate cancer."-" 

The coincidence of some gene symbols with medkal ab- 
breviations, chemical abbreviations and biological abbrevia- 
tioris resulted In most of the fabe positives (see the Supporting 
InfonnaUon. SupplemenUl Table 4. or visit http://hlpse. 
q.medJiarvard.edu/MedG€ne/pubUcaUon/s_Table 4.hUnl). em- 
phasizing the importance of the Utters that were added in the 
search algorithm QTable 1). Without the filters, the false positive 
rale more than doubled, and the false negative rale rose 
diamallcally (data not shown). For example, among the papers 
about breast cancer, there were only 12 Medline records that 
rcfenred to ESRJ and 10 to ESR2, whereas almost 2000 papers 
mentioned estrogen receptor without sped^g £SI?J or ESR2i 
this latter group was detected by the famUy stem tenn filter. 

To fUnher validate tiiese results, a gbbal amlysls of the gene- 
disease relationships described by McdCcne was performed. 
For this experiment. It was reasoned that the more closely 
related the diseases are to one another, the mora they will be 
related to the same gene sets. Thus, if the relationships defined 
by MedCene accurately reflected the literature, tlien an unsu- 
pcrvised Mcrarcliical clusteririg of the gene data should group 
diseases in a manner consistent with common medical think- 
ing. Conversely, if the clustered diseases do not make sense 
biologically or medically. It may refXea excessive false positives, 
false negatives, or Inappropriate scoring of tlie data. 

To execute this experiment, the gene sets and the corre- 
sponding LPF values for 1000 randomly selected diseases (eacli 
with at least 50 gene relationships) were used as a datasct for 
clustering the diseases. A review of the results showed that the 
resulting disease dusters were Indeed logical based upon 
common medical knowledge {see the SuppofUng InfonnaUon. 
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Supplemental Figure I, or visit http^/hlpseq jnediuurvard.edu/ 
MedCene/pubUcatlon/s^igure IJitrnQ. For example. In one 
such duster shown in Figure 2. diabetes and its complications 
grouped together and were also dosely linked to diseases 
assodated with starvation states. 

The number of genes assodated wKh a ^en disease can 
be estimated by adjusting the MedCene number up by the false 
negathre rate {'-9%) and down by the false positive rale (-26% 
on average). Using this, the average disease has 103.7 dk 45.3 
(mean ± s.d.) genes assodated with it, although the range Is 
quite broad with 2359 genes related to breast cancer, 2122 
genes related to lung cancer and no genes related to a number 
of diseases. 

Applying MedGene to the Analy^ of Large Datascts. Access 
to a comprehensive summary of the genes linked to human 
diseases provided an opportunity to analyze data obtained from 
a high'throughput experiment We compared the MedGene 
breast cancer gene Ust to a gene expression data set generated 
from a micro-array analysis comparing breast cancer and 
normal breast tissue samples. Micro-array analysis identified 
2286 genes tiiat had greater than a l>fold dilTerence In mean 
expression level between breast cancer samples and normal 
breast samples. Using MedCene. we sorted the 2286 genes into 
four classes: 555 genes directly linked to breast cancer In (he 
literature by gene term search (first>degree association by gene 
name): 328 genes directly linked by family term search (first- 
degree assodation by family term); 1021 genes linked to breast 
cancer only through other breast cancer genes (second-degree 
association): and 505 genes not previously associated vtrlth 
breast cancer. (See the Supporting Information, Supplemental 
Figure 2. or visit http://hIpscq.med.harvard.edu/MedCene/ 
publlcation/sj^lgure 2.hunl.) Among the 505 previously un- 
related genes. 467 were either newly Identified genes or genes 
tliat had not previously been associated witli any disease. 
Among die remaining 38 genes. 9 had been related to other 
cancers, specifically esophageal coloa uterine, skla and cervix. 

To determine whether the genes highlighted by die micro- 
array analysis were more likely lo have been previously linked 
to breast cancer in the Ulerature, wo created a two-dimensional 
plot of the fold change of exprcsston level between breast 
cancer and normal tissue versus the literature score (LPF) 
(Figure 3A). There was a broad spread of expression changes 
amoi>g the genes directly linked to breast cancer ranging from 
less than i-fold change (68%) to over 40-fQld (03%). NotaUy. 
the irajority of genes wkh greater than 10-fcdd expression 
changes were linked to breast cancer by first-degree assoda- 
Uon. 

Among all 754 genes directly linked to breast cancer In the 
literature, there was no correlation between LPF and miao> 
array fold change (r =* 0.018, p-valuc = 0.62). However, when 
we stratified the analysis based on the magnitude of the fold 
change, we observed an Increasing trend in correlation (Figure 
3B) suggesting that genes with a more subsUntlal change in 
expression levd were more likely to have a su-onger assodation 
In die literature. For genes that had l0>fold change or more in 
expression level, Uie correlation Increased to 0.41 (o-value « 
0.05). 

When we evaluated the micro-array data separately for ER 
positive and ER negative tumors, the U-cnd in correlaijon 
between fold change and literature score was higlily dependent 
on estrogen receptor status. Interestingly, there was a similar 
U-cnd In corrdaUon for ER posIUve tumors^ but no trend in 
correlation for £R negative tumors. 



Analysis of Data Using Advanced Literature Mining 



research articles 



A 




B 



Obesity in Piatoatos 
Diabetic Kotoaeidoala 

Oliioode Xntoleranoe 

DlaDtttos M«Xlitus« NQQ-Xn0uXla-S9p«a4«nt 

Diabetes KAllitu8« insulin-Depandent 

Pregnancy in Diabetica 

Diabetic Retinopatby 

Diabetic Angiopathies 

Diabetic Neuropabhiea 

aiycosuzla 

Byperinsuliaism 

Hyperinsullnamla 

Hypogrlycenla 

Hyperglycemia 

Diabetes MeXlitue« Bxparinental 

Diabetes HeXIltus 

Diabetes* Gestational 

?taxvaCfon 

Jaundice, Kecnatal 

Brain Bdema 

FulJBonaxy Sdoona 

nutrition Dlsordera 

Rwaahiorkor 

Critical Xllnosa 

Burna 

Diabetic Nephropathies 

Albtuoinuria 

Insulinoma 




Figure Z Global validation by clustering analysis. 2(AK The gene sets and the corresponding LPF values for 1000 diseases, each with 
ai least 50 gene relationships, wera used in an unsupervised clustering of the diseases based on the gene patterns associated with 
them. A sample of the data Is shown here. 2(q>. One of the resulting clusters Is shown that corresponds to blood sugar states. Diabetes 
terms (at>ove the line) and starvation states terms (under the line) clustered together. Within these groups, there b also clustering of 
diabetic small vessel complications, altered serum chemistries, nutritional disorders. etc,(SuppTemen(at Figure 1: http;//h{pseq.med. 
han/ard.edu/MedGene^pubiicatlon/s^Ftgure Lhtrnl). 

FinaUy. to validate our Gndlngis. we computed simUar cor- disease unrelated to breast cancer. As expected, we did not 
relations between ihe breast cancer expression data and observe an increasing trend in correlation for hyperlcn* 
LPF scores generated by McdGene for hypeitenslon. a slon. 
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Microarray Fold Change (Cancer/Normal) 




Microarray Fold Change 



Figure 3. Relationship between literature score and functional data for breast cancer. 3A. The data from an expression analysis of 
samples for breast tumors and nDrmat breast tissue Mrere analyzed to Indicate Ihe fold difference of expression level between hreast 
tumor and normal sample (cMoff > 3-fold change). The fo!d changes were plotted agalnsi the literature score for the same gene set. 
Green dots represent first-degree association by gene search, blue dots represent first-degree association by family search and red 
dots represent no-associailon. Some welt-siudled genes, such as BRCA2 (pink circle), are not reflected by a subsiamlar difference In 
expression level. Furthermore. Ihe majority of genes that have no association with breast cancer In the literature had less than 10-fold 
expression changes (shaded area). SB. The Spearman rank-coaefailon coeff Icienis between literature score (LPF) and the fold change 
of expression level between tumor and normal breast samples O^axis) in relation to the amount of fold change of expression level 
(x-axIs). Gene rank lists were generated for breast cancer (blue) and hypertension (jpink). Correlations were also computed between 
the breast cancer gene IPF scores and fold change expression data among estrogen receptor p»lUve tumors only OlgM Wue) and 
estrogen receptor negative tumors only (purple). 
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brcAM neoplasnu 


hypertension 


rhcumaldd arthritis 


bipolar disorder 


flOitfosdetosIs 


estrogen receptor 


REN 


RA 


ERDAt 


apolipoprotcln 


PCR 


DBP 


TNFRSFIOA 


SNAP29 




ERDB2 


lEP 


CRP 


PFKL 


IDIR 


BRCAl 


ACT 


AS 


DR02 


ELN 


BRCA2 


INS 


ESRt 


TRH 


ARGl 


EGFR 


kallikrcin 


HLA-DRBi 


1MPA2 


APOB 


cms 


ACE 


DRi 


HTR3A 


APOAl 


TFFI 


cndothclin 


interlcukin 


DRD3 


MSRl 


PSEN2 


SiOOAS 


TNF 


REM 


LPl 


TP53 


BDK 


116 


KCNN3 


PONI 


CES3 ' 








plasmirx)gen 


DIANPH 


collagen 


DRD4 


activator inhibitor 


CEACAM5 


SARi 


ILtA 


HTR2C 


PLC 

vascular cell 


ERBB3 


PIH 


ACR 


RELN 


adhesion molecule 


cydin 


CDS9 


TNFRSFiZ 


DBH 


Atom 


caxsA 


ALB 


ILB 


MAOA 


VWF 


cathepsin 


CYFliBZ 


CHI3U 


com 


INS 


ERDB4 


MAT2B 
angiotensin 


IL8 


HTR2A 


ARC2 


TRAM 


receptor 


inicrleukin 1 
matrix 


SYNji 


ABCAi 


CCNDI 


Acm 


mecalloprotelnase 


INPPl 


out! 


EOF 


NPPA 


inlerferon 


NEDD4L 


collagen 


MUCi 


LVM 


CD66 


FRAI3C 
transducer of 


MCP 


insulin- like 


DBH 


IL4 


ERBB2 


lipoprotein 


Ba2 


NPY 


iU7 


BAIAP3 


AP0A2 
Intercellular 


mucin 


POMC 


MMP3 


ATPIB3 


adhesion molecule 


FGF3 


neuropeptide 


SH 


DRDS 


RAB27A 



' McdGcne results for the top 2S genes iusodatcd with breast neophsm faypciteiision. rheumatoid aithiUs* bipolar disorder, and alheiosclerasis^ respccUvely; 
ranked by LPP scores. The hyperilfik to all the papers co-citing the gene and the disease is available at MedGene wefastic fhttp://hlpscq.inedharvardedu/ 
McdCene/). ^ 



Discussion 

The Hunian Genome Project heralded a new era In biological 
research where the emphasis on understanding specific path- 
ways has expanded (o global studies of genomic organization 
and biological systems. High- throughput technologies can 
provide novel insight inlo comprehensive biological AiTicUon 
but also introduces new challenges. The utility of these 
leduiologlcs is limited to the abUily to generate, analyze, and 
interpret large gene lists. MedGcne, a relational database 
derived by mining the Information in Medline, was created to 
address this need. McdCene users can query for a rank-ordcnsd 
list of human gene-dlscase relationships fTable 2) for one or 
more diseases. Each entry Is hyperiinked to tiie original papeis 
supporting each association and to oUier relevant databases. 

MedGenc is an irmovatlvc cxlcnsion of previous text mining 
approaches. Perez-IraUcU el al. used (he CO annotation and 
their chromosomal locations (o predict genes that may con- 
tribute to inherited disorders." MedCene takes a broader view 
and includes alt diseases and all possible gene-diseue ndallon* 
ships. Furthermore. MedCene uUUzes co-dution to indicate a 
rdaUonship rather than CO annotaUoa which is limited to the 
subset of genes Uiat iiavc CO annotation. Our approach is 
complementary to that taken by Chaussabel and Sher. who 
used Uie frequcn^ of co-clted terms to cluster genes inlo a 
liicrarchy of gene-gene relationships.* 

A unique aspect of this tool Is the ability to assess the relative 
strengths of gene-disease relationships based on ihe frequency 
of both co-citaUon and single citation. This presupposes that 
most co-ciiations describe a poslQve association, often refcrrcd 
to as publlcaUon bias** and is supported by our observations 



that negative associaUons are rare Supplemental Table 3: 
htip://hipseqjncdJiarvard.edu/MedGcne/publIcation/s.Ta- 
ble 3.html). Of course. relaUonshlps established by frequency 
of co-dlation do not necessarily represent a true blologjcal link: 
however, it is strong evidence to support a (rue relationship. 

Another important feature of MedCene is the Impleincnta- 
tlon of software flltefs that substantially reduced the error rate. 
We estimate that less than 10% of all associations were missed 
and at least 70% of even the weakest associations were real. 
For this study, all of the Alters that we applied were general 
ones. e.g.. expanding tlie list of aU gene names to address the 
different syntax fcums used by dUferent Joumals. eUmb^ating 
gene names that correspond to common English words, etc. 
The majority of the remaining search term ambiguities were 
idiosyncratic and dilBcutt to Identify systemaUcaUy without 
causing a significant rise in false negaUvcs. AitcmaUve ap- 
proaches; such as tlie examination of the nearest neighbor 
terms, need to be considered to further reduce the false positive 
rate. 

It Is not uncommon to see expression changes in micro- 
array experiments assmall as 2-fold reported in the literature. 
Even when these e}q)ress!on changes are statistical^ significant. 
It Is not always dear if they are biologically meaningful. When 
comparing expression levels of disease to normal tissue, one 
expects an enrichment of known disease-related genes to 
appear in the altered expression group. MedCene provided a 
unique opportunity to test this notion in the context of existing 
knowledge on a novel breast cancer micro-array datasct. For 
genes displaying a 5-fold change or less in tumors compared 
to normal, there was no evidence of a correlation between 
altered gene expression and a known role in the disease. This 
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TaUo S. Genes with Large Expression Changes in ER- btrt 
Not In ER+ Breast Tumors 



gene symbol 


fold ctiange (ER+) 




KKTHBl 


1.0 


610.8 


BRS3 


1.2 


89.4 


DKIU 


1.2 


69.8 


ZiCi 


1.9 


59.6 


TlMt 


1.0 


38.5 


KIAA0680 


2.6 


33.2 


CDKN3 


l.O 


30.6 


EB12 


4.0 


27.9 


GZMB 


3.B 


21.9 


STK18 


4.7 


18.6 


CPR49 


1,0 • 


14.6 


MYOlO 


1.6 


14.4 


LADI 


-1.0 


13.5 


P0LE2 


4.2 


13.0 


HMC4 


4.4 


12.9 


BCUlil 


-\2 


12.3 


IRP8 


Z3 


12.2 


CCNB2 


t.O 


11.8 


CCNE2 


4.0 


11.6 


FOB 


-4J 


U.l 


KNSL6 


2.9 


ia9 


HtFS 


3X) 


ia2 


SERPINH2 


4.6 


10.2 


YAPl 


1.0 


10.0 


LPHB 


-1.3 


-10.4 


TCEA2 


-1.1 


-10.8 


TFFI 


1.3 


-11.4 


COU7A1 


-4.1 


-15,7 


POPS 


l.t 


-16.2 


BPACi 


-4.6 


-223 


FDZKi 


-I.I 


-36.8 


VECFC 


-2.8 


-51.5 


MUC6 


-1.4 


-64.9 


SERPINA5 


-1.0 


-83.1 


MBSi 


-1.6 


-85.9 


CA12 


2.4 


-1503 



T«U»)e 3. McdGcne IdctHiflcd a set of relsitivd> iindcrsludted* yet highly 
expressed genes In ER ncgaltve. bul noi £R posilive breast luraocs. All of 
Uiesc genes have cttbcr never been co-dted with breast cancer or have a 
weak assodation except those marked with an *. 

reflects the many genes whose role In breast cancer may not 
involve laiigc changes In expression in sporadic tumors (e.g., 
BRCAl and BRCA^ and genes whose modest changes In 
expression may be unrelated to the disease. Sdildngly, among 
genes with a 10-foid change or more In expression level, there 
was a strong and signiGcant conelaUon tietwcen expression 
level and a published role in the disease, providing the first 
global validation of the micro-array approach to identifying 
diseaso-spedfic genes. 

The results derived from MedCene have two iinpUcailons. 
First, a careTul hunt for corroboratlr^ evidence of a role in 
breast cancer slwuld precede any further study of genes with 
less than S-fold expression level changes. Second, any gimcs 
with 10-fohl changes or more are likely to be related to breast 
cancer and wanrant attenUoa It is likely that this threshold will 
change depending on tlie disease as well as tlie experiment 

Interestingly, the observed coirdalion was only found among 
lER-positivc tumors, not ER-negative. This may reOea a bias 
in the literature to study the more prevalent type of tumor in 
the population. Furthermore, this emphasizes that caution 
must be taken when interpreting exfieriments that may contain 
subpopulalions that behave very dinbrcntly. The McdCcne 
approach identified a set of relatively understudied, yet highly 
expressed genes In ER-ne^tive tumors that are wortliy of 
further examination (Table 3). 
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In conclusion, we have developed an automated method of 
summarizing and organizing the vast txiomedlcal literature. To 
our knowledge, the resulting database is the most comprehen- 
sive and accurate of its kind. By generating a score that reflects 
the strength of the assodadon. it provides an Important tool 
for the rapid and flexible anaSy^ of large datasets from various 
hJgh-thrcHighput screening experiments. Furthermore, It can 
be used for selecting subsets of genes for functional studies, 
for building disease-speclflc arrays, for looking at genes com- 
mon (o multiple diseases and various other high-throughput 
applications. In the future, it wiU be possible to enhance the 
utility of the MedGene database by building links between 
genes and other MeSH terms as well as other blologlcai 
processes and concepts, such as cell division and responses to 
small molecules. 
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is that risk relates to the environment and 
human health. On the other hand, recent 
studies have repeatedly shown that public 
hesitance also includes a number of ethical 
issues (e.g., market dominance of a few large 
companies and GM crops threatening 
natural or divine orders> refs 
1 >2). Our worry is that the 
US government is 
neglecting widespread 
concerns of the European 
pubHc that include more 
than environmental risk 
and human health. 

Research carried out by 
our group in Denmark* 
indicates that, although 
many people are confident 
that the public authorities 
are able to manage the risks 
here and now, people are less confident about 
their ability to handle long-term effects 
because of the scientific uncertainty. 
Attempts to conceal these or other limits to 
scientific knowledge do not prevent 
controversies from arising; rather, the 
opposite happens because trust in business, 
scientific experts and public authorities is 
undermined (witness the handling of the 
BSE controversy in the United Kingdom). 

In the long run» a policy of openness about 
the different dimensions of uncertainty 
would be more likely to increase trust in 
scientific risk assessment. Of course, this 
will not guarantee public acceptance of GM 
food, but experience in Europe shows that 
transparency and dialog are prerequisites for 
decreasing concerns about new technology. 




The argument that the EU s resistance 
to GM food has had negative consequences 
for developing countries, denying them 
access to a technology that could alleviate 
food provision, is regarded sympathetically 
by many among the European public. 

Indeed, here most people 
abandon the simple 
dichotomy between 
'unacceptable' GM food and 
the much more acceptable 
medical uses. This is because 
GM foods 

are seen as a means to help 
people in distress. Many 
counter such humanitarian 
uses, however, by the 
observation that, in general, 
GM crops are developed not 
to benefit people in the 
developing world, but to make money. 
Needless to say, according to those who 
point this out, making money is not in 
itself an acceptable objective. Thus, the fear 
is that the benefits will never accrue to 
those who are at present suffering. 

Kristian Borch, Jesper Lassen 
& Rikke B Jorgensen 

Centre for Bioethics and Risk Assessment, 
Systems Analysis Department, 
PO Box 49, DK'4000 Roskilde, Denmark 
e-mail: kristian.horch@risoe,dk 
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263-271 (2002). 

2. Wagner. W. et al. in Biotechnology 1996-2000. The 
Years ofCorttn>versy{Gaske\, G. & Baver, M.W., eds.) 
80-95 (Science Museum, London, 2001) 



Mining the literature and large 
datasets 



(or perhaps was exacerbated by) a UK 
government seen to be welcoming of GM 
foods and crops. Another negative was that 
it was major transnational corporations — 
another questionable community in the 
eyes of much of the public here — that 
were seeking to push their new products 
onto tiie public without previous debate 
and without there being any perceptible 
benefit. And finally, the potentially negative 
impact of GM crops on organic farmers — 
who are seen by some as crucially 
important for the sustainable future of 
food production — and the relatively small 
scale of agricultural production in the 
United Kingdom (and Europe) have also 
been important issues. 

The question to be answered, therefore, 
is not how to force the EU to accept GM 
foods and crops against its own public 
opinion, but how to change public opinion 
in the EU. The UK government is currently 
conducting several exercises that it hopes 
will provide the facts to support a relaxation 
of the moratorium on growing GM crops. 
These include a major review of the costs 
and benefits of GM crops (just finished), 
a scientific review of the issues (also now 
finished), a series of crop trials (results in 
September) and a public debate on GM 
crops, *GM nation' (just finished). 

Whether these will change attitudes is 
moot: the costs-and-benefits review has 
concluded that the economic value of the 
few currently available GM crops that could 
be grown in the UK is likely to be limited 
because of negative consumer attitudes to 
GM foods. 

MH Barnes 

3 The Spinney, 
Watford WD174QF, UK 
e-maib MBames466@aolcom 

To the editor: 

Several articles in the July and August issues 
of Nature Biotechnology (21, 735-738, 2003; 
21, 852-854, 2003) discuss whether the US 
strategy of forcing the European Union 
(EU; Brussels, Belgium) to accept GM foods 
by referring to World Trade Organisation 
(WTO; Geneva, Switzerland) rules will bear 
fi-uit. We do not believe so— rather the 
opposite. 

A central claim in the arguments of 
both President Bush and US commerce 
representative Robert B. Zoellick is that the 
risk of GM foods is negligible. The veracity 
of that statement, however, depends on what 
is defined as risk. A common understanding 



To the editor: 

In the accelerating quest for disease 
biomarkers, the use of high-throughput 
technologies, such as DNA microarrays 
and proteomics experiments, has produced 
vast datasets identifying thousands of 
genes whose expression patterns differ in 
diseased versus normal samples. Although 
many of these differences may reach 
statistical significance, they are not always 
biologically meaningful. For example, 
reports of mRNA or protein changes of 
as little as two-fold are not uncommon, 
and although some changes of this 
magnitude turn out to be important, most 



are attributable to disease-independent 
differences between the samples. Evidence 
gleaned from other studies linking genes to 
the disease is helpful, but with such large 
datasets, a manual literature review is often 
not practical. Thus, the power of these 
emerging technologies — the ability to 
quickly generate large sets of data — has 
challenged current means of evaluating 
and validating these data. One study from 
1999, for example, reveals that a researcher 
would have to scan 130 different journals 
and read 27 papers per day to follow a 
single disease, such as breast cancer'. 
To address this need, my group at 
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Harvard recently developed a freely 
accessible automated literature- 
mining tool) termed MedGene, that 
comprehensively summarizes the 
relationships among over 50,000 named 
human genes (and their synonyms) and 
over 4,000 human diseases from over 
12 million records in Medline 
(!utp://hipseq.med.harvard.edu/MedGene), 
Several key features of this resource are 
worth noting. First, MedGene is not 
limited to any specific relationship type, 
but rather encompasses all reported 
gene-disease links, including the 
genetic, biochemical, pharmacological, 
epidemiological and physiological. Second, 
the database assigns a mathematical score 
summarizing the strength of the 
association between the disease and the 
gene, which allows semiquantitative 
analysis and organizes the genes in rank 
order. Finally, the relationships are 
identified automatically by advanced text 
searching and filtering algorithms that 
result in low 

rates of false-positive and false-negative 
linkages^. In one query, MedGene 
identified nearly 2,400 breast 



cancer-related genes, whereas the same 
search in four commonly used databases 
yielded a combined total of 286 genes, 
260 of which were included in the 
MedGene list'-^ 

A summary of all gene-disease 
relationships offers the unique opportunity 
to both evaluate and validate the outcome 
of high-throughput experiments. For 
example, we used MedGene to analyze a 
DNA microarray experiment in which over 
2,000 genes demonstrated statistically 
significant differences in expression 
between normal breast tissue and breast 
cancer. It was able to identify the subset of 
these genes previously described as breast 
cancer-related genes in the literature. To 
determine whether gene expression level 
correlated with the strength of the 
association between gene and breast 
cancer, we plotted gene expression levels 
against the breast cancer literature 
relationship scores assigned by MedGene. 
Interestingly, 

there is no correlation when considering 
expression differences as high as fivefold; 
however, a significant correlation is 
observed (r = 0.41; P = 0.05) among genes 



showing a difference of tenfold or more. 
Thus, for this experiment, expression level 
differences as high as fivefold cannot be 
attributed to the disease without 
corroborating evidence. It will be 
interesting to learn if similar results hold 
for other diseases and other experiments. 

As the search for disease biomarkers and 
drug targets comes to rely increasingly 
upon genomic-scale technologies, demand 
will grow for automated resources, such as 
MedGene, that help process the resulting 
data volume. 

]o$\\uaLaBacr 

Institute of Proteomics, 
Harvard hdedical School 
250 Lotigwood Ave., BCMP, 
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1 Introduction 

A proteome has been defined as the protein complement 
expressed by the genome of an organism, or, in multicel- 
lular organisms, as the protein complement expressed by a 
tissue or difTerenliated cell IM. In the most common im- 
plementation of proteome analysis the proteins extracted 
from the cell or tissue analyzed are separated by high 
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Bioiechnoloey, University of Washington, Box 357730, Scatile. WA, 
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Abbreviations: CID, collision-induced dissociation; MS/MS, tandem 
mass spectrometry; SAGE, serial analysis of gene expression 

Keywords: Proicome / Two-dimensional potyacrylamide gel electro- 
phoresis / Tandem mass spectrometry 
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Proteome analysis: Biological assay or data archive? 

In this review we examine the current state of proteome analysis. Tliere are 
three main issues discussed: why it is necessary to study proteomes; how pro- 
teomes can be analyzed with current technology; and how proteome analysis 
can be used to enhance biological research. We conclude that proteome anal- 
ysis is an essential tool in the understanding of regulated biological sys ems. 
Current technology, while still mostly limited to the more abundant pro terns, 
enables the use of proteome analysis both to establish databases of proteins 
oresent, and to perform biological assays involving measurement of multiple 
variables. We believe that the utility of proteome analysis in future biologica 
research will continue to be enhanced by further improvements m analytical 
technology. 

resolution two-dimensional gel electrophoresis (2-DE), 
detected in the gel and identified by their amino acid 
sequence. The ease, sensitivity and speed with which gel- 
separated proteins can be identified by the use of recently 
developed mass speclrometric techniques have dramati- 
cally increased the interest in proteome technology. One 
of the most attractive features of such analyses is that conri- 
plex biological systems can potentially be studied in their 
entirety, rather than as a multitude of individual compo- 
nents. This makes it far easier to uncover the many com- 
plex and often obscure, relationships between mature 
gene products in cells. Large-scale proteome characteriza- 
tion projects have been undertaken for a number of dif- 
ferent organisms and cell types. Microbial proteome pro- 
jects currently in progress include, for example: Saccharo- 
myces cerevisiae [2], Salmonella enterica [3], Spiropiasma 
melliferum [41, Mycobacterium tuberculosis [5], Ochrobac- 
trum anthropi [61, Haemophilus influenzae [7], Synecho- 
cystis spp, [81, Escherichia coli [9], Rhizobium legumino- 
sarum [10], and Dictyostelium discoideum [111. Proteome 
projects underway for tissues of more complex organ- 
isms include those for: human bladder squamous cell 
carcinomas [121, human liver [13], human plasma [13], 
human keratinocytes [121, human fibroblasts 1121. mouse 
kidney [121, and rat serum [14]. In this manuscript we cri- 
tically assess the concept of proteome analysis and the 
technical feasibility of establishing complete proteome 
maps, and discuss ways in which proteome analysis and 
biological research intersect. 



2 Rationale for proteome analysis 

The dramatic growth in both the number of genome 
projects and the speed with which genome sequences 
are being determined has generated huge amounts ol 
sequence information, for some species even complete 
genomic sequences ([15-171). The description of the 
state of a biological system by the quantitative measure- 
ment of system components has long been a primary 
objective in molecular biology. With recent technical 
advances including the development of differential dis- 
play-PCR [18], cDNA microarray and DNA chip techno- 
logy [19, 20] and serial analysis of gene expression 
(SAGE) [21, 22], it is now feasible to establish global and 
quantitative mRNA expression maps of cells and tissues, 
in which the sequence of all the genes is known, at a 
speed and sensitivity which is not matched by current 
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protein analysis technology. Given the long-standing 
paradigm in biology that DNA synthesizes RNA which 
synthesizes protein, and the ability to rapidly establish 
global, quantitative mRNA expression maps, the ques- 
tions which arise are why technically complex proteome 
projects should be undertaken and what specific types of 
information could be expected from proteome projects 
which cannot be obtained from genomic and transcript 
profiling projects. We see three main reasons for pro- 
teome analysis to become an essential component in the 
comprehensive analysis of biological systems, (i) Protein 
expression levels are not predictable from the mRNA 
expression levels, (ii) proteins are dynamically modified 
and processed in ways which are not necessarily 
apparent from the gene sequence, and (iii) proteomes 
are dynamic and reflect the state of a biological system. 

2.1 Correlation between mRNA and protein expression 

levels 

Interpretations of quantitative mRNA expression profiles 
frequently implicitly or explicitly assume that for specific 
genes the transcript levels are indicative of the levels of 
protein expression. As part of an ongoing study in our 
laboratory, we have determined the correlation of expres- 
sion at the mRNA and protein levels for a population of 
selected genes in the yeast Saccharomyces cerevisiae 
growing at mid-log phase (S. P. Gygi et al., submitted for 
publication). mRNA expression levels were calculated 
from published SAGE frequency tables [22]. Protein 
expression levels were quantified by metabolic radiola- 
beling of the yeast proteins, liquid scintillation counting 
of the protein spots separated by high resolution 2-DE 
and mass spectrometric identification of the protein(s) 
migrating to each spot. The selected 80 samples consti- 
tute a relatively homogeneous group with respect to pre- 
dicted half-life and expression level of the protein pro- 
ducts. Thus far, we have found a general trend but no 
strong correlation between protein and transcript levels 
(Fig. I), For some genes studied equivalent mRNA trans- 
cript levels translated into protein abundances which 
varied by more than 50-fold. Similarly, equivalent steady- 
state protein expression levels were maintained by trans- 
cript levels varying by as much as 40-fold (S, P. Gygi 
et aL, submitted). These results suggests that even for a 
population of genes predicted to be relatively homoge- 
neous with respect to protein half-life and gene expres- 
sion, the protein levels cannot be accurately predicted 
from the level of the corresponding mRNA transcript. 

2.2 Proteins are dynamically modified and processed 

In the tnature, biologically active form many proteins are 
post-translalionaliy modified by glycosylation, phosphor- 
ylation, prenylation, acylation, ubiquitination or one or 
more of many other modifications [23] and many pro- 
teins are only functional if specifically associated or com- 
plexed with other molecules, including DNA, RNA, pro- 
teins and organic and inorganic cofactors. Frequently, 
modifications are dynamic and reversible and may alter 
the precise three-dimensional structure and the state of 
activity of a protein. Collectively, the state of modifica- 
tion of the proteins which constitute a biological system 
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Figure I. Correlation between mRNA and protein levels in yeast cells. 
For a selected population of 80 genes, protein levels were aieasured 
by ^'-S-radiolabeling and mRNA levels were calculated from publi- 
shed SAGE tables. Inset; expanded view of the low abundance region. 
For more experimental details, also see Figs. 5 and 6, (S. P. Gygi et a/., 
submitted). 



are important indicators for the state of the system. The 
type of protein modification and the sites modified at a 
specific cellular state can usually not be determined 
from the gene sequence alone. 

2.3 Proteomes are dynamic and reflect the state of a 
biological system 

A single genome can give rise to many qualitatively and 
quantitatively different proteomes. Specific stages of the 
cell cycle and states of differentiation, responses to 
growth and nutrient conditions, temperature and stress, 
and pathological conditions represent cellular states 
which are characterized by significantly different pro- 
teomes. The proteome, in principle, also reflects events 
that are under translational and post-translational con- 
trol. It is therefore expected that proteomics will be able 
to provide the most precise and detailed molecular des- 
cription of the state of a cell or tissue, provided tliat the 
external conditions defining the state are carefully deter- 
mined. In answer to the question of whether the study 
of proteomes is necessary for the analysis of biomolec- 
ular systems, it is evident that the analysis of mature pro- 
tein products in cells is essential as there are numerous 
levels of control of protein synthesis, degradation, 
processing and modification, which are only apparent by 
direct protein analysis. 



3 Description and assessment of current proteome 
analysis technology 

3.1 Technical requirements of proteome technology 

In biological systems the level of expression as well as 
the states of modification, processing and macro-molec- 
ular association of proteins are controlled and modu- 
lated depending on the state of the system. Comprehen- 
sive analysis of the identity, quantity and state of modifi- 
cation of proteins therefore requires the detection and 
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quantiiaiion of the proteins which constitute the system, 
and analysis of diflerentially processed forms. There are 
a number of inherent difficulties in protein analysis 
which complicate these tasks. First, proteins cannot be 
annplified. It is possible to produce large amounts of a 
particular protein by over-expression in specific cell sys- 
tems. However, since many proteins are dynamically 
post-translationally modified, they cannot be easily am- 
plified in the form in which they finally function in the 
biological system. It is frequently difficult to purify from 
the native source sufficient amounts of a protein for 
analysis. From a technological point of view this trans- 
lates into the need for high sensitivity analytical tech- 
niques. Second, many proteins are modified and pro- 
cessed post-translatipnally. Therefore, in addition to the 
protein identity, the structural basis for differentially 
modified isoforms also needs to be determined- The dis- 
tribution of a constant amount of protein over several 
differentially modified isoforms further reduces the 
amount of each species available for analysis. The com- 
plexity and dynamics of post-translational protein edit- 
ing thus significantly complicates proteome studies. 
Third, proteins vary dramatically with respect to their 
solubility in commonly used solvents. There are few, if 
any, solvent conditions in which all proteins are soluble 
and which are also compatible with protein analysis. This 
makes the development of protein purification methods 
particularly difficult since both protein purification and 
solubility have to be achieved under the same condi- 
tions. Detergents, in particular sodium dodecyl sulfate 
(SDS), are frequently added to aqueous solvents to 
maintain protein solubility. The compatibility with SDS 
is a big advantage of SDS polyacrylamide gel electro- 
phoresis (SDS-PAGE) over other protein separation 
techniques. Thus, SDS-PAGE and two-dimensional gel 
electrophoresis, which also uses SDS and other deter- 
gents, are the most general and preferred methods for 
the purification of small amounts of proteins, provided 
that activity does not necessarily need to be maintained. 
Lastly, the number of proteins in a given cell system is 
typically in the thousands. Any attempt to identify and 
categorize all of these must use methods which are as 
rapid as possible to allow completion of the project 
within a reasonable time frame. Therefore, a successful, 
general proteomics technology requires high sensitivity, 
high throughput, the ability to differentiate differentially 
modified proteins, and the ability to quantitatively dis- 
play and analyze all the proteins present in a sample. 

3,2 2-D electrophoresis - mass spectrometry: a common 
implementation of proteome analysis 

Tlie most common currently used implementation of 
proteome analysis technology is based on the separation 
of proteins by two-dimensional (lEF/SDS-PAGE) gel 
electrophoresis and their subsequent identification and 
analysis by mass spectrometry (MS) or tandem mass 
spectrometry (MS/MS). In 2-DE, proteins are first separ- 
ated by isoelectric focusing (lEF) and then by SDS- 
F*AGE, in the second, perpendicular dimension. Separ- 
ated proteins are visualized at high sensitivity by staining 
or autoradiography, producing two-dimensional arrays of 
proteins. l-DE gels are, at present, the most commonly 
used means of global display of proteins in complex 



samples. The separation of thousands of proteins has 
been achieved in a single gel [24, 25) and differentially 
modified proteins are frequently separated. Due to the 
compatibility of 2-DE with high concentrations of deter- 
gents, protein denaluranls and other additives promoting 
protein soliibility, the technique is widely used. 

The second step of this type of proteome analysis is the 
identification and analysis of separated proteins. Individ- 
ual proteins from polyacrylamide gels have traditionally 
been identified using ^terminal sequencing [26, 27], 
internal peptide sequencing [28, 29], immunoblotting or 
comigration with known proteins (30). The recent dra- 
matic growth of large-scale genomic and expressed 
sequence tag (EST) sequence databases has resulted in a 
fundamental change in the way proteins are identified by 
their amino acid sequence. Rather than by the traditional 
methods described above, protein sequences are now fre- 
quently determined by correlating mass spectral or 
tandem mass spectral data of peptides derived from pro- 
teins, with the information contained in sequence data- 
bases [31-33]. 

There are a number of alternative approaches to pro- 
teome analysis currently under development. There is 
considerable interest in developing a proteome analysis 
stragegy which bypasses 2-DE altogether, because it is 
considered a relatively slow and tedious process, and 
because of perceived difiTicuIties in extracting proteins 
from the gel matrix for analysis. However, 2-DE as a 
starting point for proteome analysis has many advan- 
tages compared to other techniques available today. The 
most significant strengths of the 2-DE-MS approach 
include the relatively uniform behavior of proteins in 
gels, the ability to quantify spots and the high resolution 
and simultaneous display of hundreds to thousands of 
proteins within a reasonable time frame. 

A schematic diagram of a typical procedure of the identi- 
fication of gel-separated proteins is shown in Fig. 2. Pro- 
tein spots delected in the gel are enzymatically or chemi- 
cally fragmented and the peptide fragments are isolated 
for analysis, as already indicated, most frequently by MS 
or MS/MS. There are numerous protocols for the gener- 
ation of peptide fragments from gel-separated proteins. 
They can be grouped into two categories, digestion in 
the gel slice (28, 34] or digestion after electrotransfer out 
of the gel onto a suitable membrane ([29, 35-37] and 
reviewed in [38]). In most instances either technique is 
applicable and yields good results. The analysis of MS or 
MS/MS data is an important step in the whole process 
because MS instruments can generate an enormous 
amount of information which cannot easily be managed 
manually. Recently, a number of groups have developed 
software systems dedicated to the use of peptide MS 
and MS/MS spectra for the identification of proteins. 
Proteins are identified by correlating the information 
contained in the MS spectra of protein digests or 
MS/MS spectra of individual peptides with data con- 
tained in DNA or protein sequence databases. 

The systems we are currently using in our laboratory are 
based on the separation of the peptides contained in pro- 
tein digests by narrow bore or capillary liquid chromatog- 
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haitre 2. Schematic diagram of a procedure for idenliftcation of gei- 
separated proteins. Peptides can either be separated by a technique 
such as LC or CE, or infused as a mixture and sorted in the MS. Data- 
base searching can either be performed on peptide masses from an 
MS spectrum, peptide fragment masses from CID spectra of peptides, 
or a combination of both. 



raphy (39, 40) or capillary electrophoresis (41 J, the anal- 
ysis of the separated peptides by electrospray ioniza- 
tion (ESI) MS/MS, and the correlation of the generated 
peptide spectra with sequence databases using the 
SEQUEST program developed at the University of Wash- 
ington (32, 33). The system automatically performs the 
following operations: a particular peptide ion character- 
ized by its mass-lo-charge ratio is selected in the MS out 
of all the peptide ions present in the system at a parti- 
cular time; the selected peptide ion is collided in a colli- 
sion cell with argon (collision-induced dissociation, 
CID) and the masses of the resulting fragment ions are 
determined in the second sector of the tandem MS; this 
experimentally determined CID spectrum is then corre- 
lated with the CID spectra predicted from all the pep- 
tides in a sequence database which have essentially the 
same mass as the peptide selected for CID; this correla- 
tion matches the isolated peptide with a sequence seg- 
ment in a database and thus identifies the protein from 
which the peptide was derived. There are a number of 
alternative programs which use peptide CID spectra for 
protein identification, but we use the SEQUEST system 
because it is currently the most highly automated pro- 
gram and has proven to be successful, versatile and 
robust. 



required. As an approximate guideline, for samples con- 
taining tens of picomoles of peptides, LC-MS/MS is 
most appropriate; for samples containing low picomole 
amounts to high femtomole amounts we use capillary 
LC-MS/MS; and for samples containing femtomoles or 
less, CE-MS/MS is the method of choice. 

3.3.1 LC-MS/MS 

The coupling of an MS to an HPLC system using a 
0.5 mm diameter or bigger reverse phase (RP) column 
has been described in detail [42J. This system has several 
advantages if a large number of samples are to be ana- 
lyzed and all are available in sufficient quantity. The 
LC-MS and database searching program can be run in a 
fully automated mode using an autosampler, thus maxi- 
mizing sample throughput and minimizing the need for 
operator interference. The relatively large column is 
tolerant of high levels of impurities from either gel prep- 
aration or sample matrix. Lastly, if configured with a 
flow-splitter and micro-sprayer [40|, analyses can be per- 
formed on a small fraction of the sample (less than 5%) 
while the remainder of the sample is recovered in very 
pure solvents. This latter feature is particularly useful 
when an orthogonal technique is also used to analyze 
peptide fractions, such as scintillation of an introduced 
radiotabel, and this data can be correlated with peptides 
identified by CID spectra. 

3.3.2 Capillary LC-MS 

An increase of sensitivity of approximately tenfold can be 
achieved by using a capillary LC system with a 100 nm ID 
column rather than a 0.5 mm ID column as referred to 
above. Since very low flow rates are required for such 
columns, most reports have used a precolumn flow split- 
ting system for producing solvent gradients. We have 
recently desribed the design and construction of a novel 
gradient mixing system which enables the formation 
of reproducible gradients at very low flow rates (low 
nL/min) without the need for flow splitting (A. Ducret 
el aL, submitted for publication). Using this capillary 
LC-MS/MS system we were able to identify gel-separat- 
ed proteins if low picomole to high femtomole amounts 
were loaded onto the gel (40J. This system is as yet not 
automated and, like all capillary LC systems, is prone to 
blockage of the columns by microparticulales when ana- 
lyzing gel-separated proteins. 



3.3 Ptolein identification by LC-MS/MS, capillary 
LC-MS/MS and CE-MS/MS 

It has been demonstrated repeatedly that MS has a very 
high intrinsic sensitivity. For the routine analysis of gel- 
separated proteins at high sensitivity, the most signif- 
icant challenge i.s the handling of small amounts of 
sample. The crux of the problem is the extraction and 
transferal of peptide mixtures generated by the digestion 
of low nanograin atnounts of protein, from gels into the 
MS/MS system without significant loss of sample or 
mtroduclion of unwanted contaminants. We employ 
ihree diflercnl systems for introducing gel-purified sam- 
ples into an MS, depending on the level of sensitivity 



3.3.3 CE-MS/MS 

The highest level of sensitivity for analyzing gel-sep- 
arated proteins can be achieved by using capillary elec- 
trophoresis - mass spectrometry (CE-MS). We have de- 
scribed in the past a solid-phase extraction capillary elec- 
trophoresis (SPE-CE) system which was used with triple 
quadrupole and ion trap ESI-MS/MS systems for the 
identification of proteins at the low femtomole to sub- 
femtomole sensitivity level (43, 44]. While this system is 
highly sensitive, its operation is labor-intensive and its 
operation has not been automated. In order to devise an 
analytical system with both the sensitivity of a CE and 
the level of automation of LC, we have constructed 
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Figure 3. Schemalic illustration of a 
microfabricated analytical system for CE, 
consisting of a micro machined device, 
coaled capillary clectroosmntic pump, 
and microelectrospray interface. The 
dimensions of the channels and reservoir 
are as indicated in the text. The channels 
on the device were graphically enhanced 
to make them more visible. Reproduced 
from (451, with permission. 



microfabricated devices for tiie introduction of samples 
into ESI-MS for high-sensitivity peptide analysis. 

The basic device is a piece of glass into which channels 
of 10-30 nm in depth and 50-70 \Lm in diameter are 
etched by using photolithography/etching techniques 
similar to the ones used in the semiconductor industry. 
(A simple device is shown in Fig. 3). The channels are 
connected to an external high voltage power supply [45]. 
Samples are manipulated on the device and off the 
device to the MS by applying dtfTerent potentials to the 
reservoirs. This creates a solvent flow by electroosmotic 
pumping which can be redirected by changing the posi- 
tion of the electrode. Therefore, without the need for 
valves or gates and without any external pumping, the 
flow can be redirected by simply switching the position 
of the electrodes on the device. The direction and rate of 
the How can be modulated by the size and the polarity 
of the electric field applied and also by the charge state 
of the surface. 

The type of data generated by the system is illustrated in 
Fig. 4, which shows the mass spectrum of a peptide sample 
representing the tryptic digest of carbonic anhydrase at 

290 fmol/^L. Each numbered peak indicates a peptide suc- 
cessfully identified as being derived from carbonic an- 



hydrase. Some of the unassigned signals may be chemical 
or peptide contaminants. The MS is programmed to auto- 
matically select each peak and subject the peptide to CID. 
The resulting CID spectra are then used to identify the 
protein by correlation with sequence databases. Therefore, 
this system allows us to concurrently apply a number of 
protein digests onto the device, to sequentially mobilize 
the samples, to automatically generate CID spectra of 
selected peptide ions and to search sequence databases 
for protein identification. These steps are performed auto- 
matically without the need for user input and proteins can 
be identified at very low femtomole level sensitivity at a 
rate of approximately one protein per 15 min. 

3.4 Assessment of 2-DE-MS proteome technology 

Using a combination of the analytical techniques de- 
scribed above we have identified the 80 protein spots 
indicated in Fig. 5. The protein pattern was generated by 
separating a total of 40 microgram of protein contained 
in a total cell lysate of the yeast strain YPH499 by high 
resolution 2-DE and silver staining of the separated pro- 
teins. To estimate how far this type of proteome analysis 
can penetrate towards the identification of low abun- 
dance proteins, we have calculated the codon bias of the 
genes encoding the respective proteins. Codon bias is a 




Figure 4. MS spectrum of a trypiic digest 
of carbonic anhydrase using the microfa- 
bricated system shown in Fig. 3. 290 
fmol/uL of carbonic anhydrase (ryplic 
digest was infused into a Finnigan LCQ 
ion trap MS. Each peak was selected for 
CID, and those which were identified as 
containing peptides derived from car- 
bonic anhydrase are numbered. Repro- 
duced from |45|, with permission. 
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Figure 5. 2-DE separation of a lysale of yeast cells, with Identified proteins highlighted. The first dimension of separation was an IPG from 
pH 3-10, and the second dimension was a 10%T SDS PAGE gel. Proteins were visualized by silver staining. Further details of experimental 
procedures are included in S. P. Gygi ei al. (submitted). 
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calculated measure of the degree of redundancy of trip- 
let DNA codons used to produce each amino acid in a 
particular gene sequence. It has been shown to be a 
useful indicator of the level of the protein product of a 
particular gene sequence present in a cell [46). The gen- 
eral rule which applies is that the higher the value of the 
codon bias calculated for a gene, the more abundant the 
protein product of that gene becomes. The calculated 
codon bias values corresponding to the proteins identi- 
fied in Fig. 5 are shown in Fig. 6b. Nearly all of the pro- 
teins identified (> 95%) have codon bias values of > 0.2, 
indicating they are highly abundant in cells. In contrast, 
codon bias values calculated for the entire yeasl genome 
(Fig. 6a) show that the majority of proteins present in 
the proteorne have a codon bias of < 0.2 and are thus of 
low abundance. 

This finding is of considerable importance in our assess- 
ment of the current status of proteorne analysis technol- 
ogy. It is clear that even using highly sensitive analytical 
techniques, we are only able to visualize and identify the 



more abundant proteins. Since many important regula- 
tory proteins are present only at low abundance, these 
would not be amenable to analysis using such tech- 
niques. This situation would be exacerbated in the anal- 
ysis of proteomes containing many more proteins than 
the approximately 6000 gene products present in yeast 
cells [16]. In the analysis of, for example, the proteome 
of any human cells, there are potentially 50000--IOG000 
gene products [47]. Inherent limitations on the amount 
of protein that can be loaded on 2-DE, and the number 
of components that can be resolved, indicate that only 
the most highly abundant fraction of the many gene 
products could be successfully analyzed. One approach 
thai has been employed to circumvent these limitations 
is the use of very narrow range immobilized pH gradient 
strips for the first-dimension separation of 2-DE [48). 
Since only those proteins which focus within the narrow 
range will enter the second dimension of separation, a 
much higher sample loading within the desired range is 
possible. This, in turn, can lead to the visualization and 
identification of less abundant proteins. 
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figure d. Calculated codon bias values Tor yeasi proteins. (A) Distribu- 
tion or calculated values for the entire yeast proteome. (B) Distribu- 
tion of calculated values for the subset of 80 identified proteins also 
shown in Figs. I and 5. Further details of experimental procedures are 
included in S. P. Gygi ei al. (submitted). 



4 Utility of proteome analysis for biological 
research 

For the success of proteomics as a mainstream approach 
to the analysis of biological systems it is essential to 
define how proteome analysis and biological research 
projects intersect. Without a clear plan for the implemen- 
tation of proteome-type approaches into biological re- 
search projects the full impact of the technology can not 
be realized. The literature indicates that proteome anal- 
ysis is used both as a database/data archive, and as a bio- 
logical assay or biological research tool. 

4 J The proteome as a database 

The use of proteomics as a database or data archive 
essentially entails an attempt to identify all the proteins 
in a cell or species and to annotate each protein with the 
known biological information that is relevant for each 
protein. The level of annotation can, of course, be exten- 
sive. The most common implementation of this idea is 
the separation of proteins by high resolution 2-DE, the 
identification of each detected protein spot and the 
annotation of the protein spots in a 2-DE gel database 
formal. Tliis approach is complicated by the fact that it is 
difficult to precisely define a proteome and to decide 
which proteome should be represented in the database. 
In contrast to the genome of a species, which is essen- 
tially static, the proteome is highly dynamic. Processes 
such as difTerentiation, cell activation and disease can all 
significantly change the proteome of a species. This is 
illustrated in Fig. 7. The figure shows two high-resolu- 



tion 2-DE maps of proteins isolated from rat serum. 
Fig. 7A is from the serum of normal rats, while Fig. 7B 
is from the serum of rats in acute-phase serum after 
prior treatment with an inflammation-causing agent (49). 
It is obvious that the protein patterns are significantly 
different in several areas, raising the question of exactly 
which proteome is being described. 

Therefore, a comprehensive proteome database of a spe- 
cies or cell type needs to contain all of the parameters 
which describe the state and the type of the cells from 
which the proteins were extracted as well as the software 
tools to search the database with queries which reflect 
the dynamics of biological systems. A comprehensive 
proteome database should be capable of quantitatively 
describing the fate of each protein if specific systems 
and pathways are activated in the cell. Specifically, the 
quantity, the degree of modification, the subcellular loca- 
tion and the nature of molecules specifically interacting 
with a protein as well as the rate of change of these 
variables should be described. Using these admittedly 
stringent criteria, there is currently no comlete proteome 
database. A number of such databases are, however, in 
the process of being constructed. The most advanced 
among them, in our opinion, are the yeast protein data- 
base YPD (501 (accessible at http://www.ypd.com) and 
the human 2D-PAGE databases of the Danish Centre 
for Human Genome Research (12) (accessible at http:// 
biobase.dk/cgi-bin/ceIis). While neither can be con- 
sidered complete as not all of the potential gene pro- 
ducts are identified, both contain extensive annotation 
of supplemental information for many of the spots 
which are positively identified in reference samples. 

4.2 The proteome as a biological assay 

The use of proteome analysis as a biological assay or 
research tool represents an alternative approach to inte- 
grating biology with proteomics. To investigate the state 
of a system, samples are subjected to a specific proceess 
that allows the quantitative or qualitative measurement 
of some of the variables which describe the system. In 
typical biochemical assays one variable (^.g., enzyme 
activity) of a single component (e.g., a particular en- 
zyme) is measured. Using proteomics as an assay, mul- 
tiple variables (e.g., expression level, rate of synthesis, 
phosphorylation state, etc.) are measured concurrently 
on many (ideally all) of the proteins in a sample. The 
use of proteomics as an assay is a less far-reaching prop- 
osition than the construction of a comprehensive pro- 
teome database. Ii does, however, represent a pragmatic 
approach which can be adapted to investigate specific 
systems and pathways, as long as the interpretation of 
the results takes into account that with current technol- 
ogy not all of the variables which describe the system 
can be observed (see Section 3.4). 

A common implementation of proteome analysis as a 
biological assay is when a 2-DE protein pattern gener- 
ated from the analysis of an experimental sample is 
compared to an array of reference patterns representing 
different states of the system under investigation. The 
state of the experimental system at the time the sample 
was generated is therefore determined by the quantita- 
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live comparative analysis of hundreds lo a few thousand 
proteins. Comparative analysis of the 2-DE patterns fur- 
ihermore highlights quantitative and qualitative differ- 
ences in the protein profiles which correlate with the 
state of the system. For this type of analysis it is not 
essential that all the proteins are identified or even visu- 



alized, although the results become more informative as 
more proteins are compared. It is obvious, however^ that 
the possibility lo identify any protein deemed character- 
istic for a particular state dramatically enhances this 
approach by opening up new avenues for experimenta- 
tion. 




Figure 7. High resolution 2-DE map of proteins isolated from rat serum with or without prior exposure lo an inflam- 
niation-causing agent. (A) normal rat serum, (B) acute-phase serum from rats which had previously been exposed to 
an inflammation-causing agent. Tiie first dimension of separation is an IPG from pH 4-10, and the second dimen- 
sion IS a 7.5-1 7. 5%T gradient SDS-PAGE gel. Proteins were visualized by staining with amido black. Further details 
of expcritiiental procedures are included in il4, 49|. 
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Proteome analysis as a biological assay has been success- 
fully used in the field of toxicology, to characterize 
disease states or to study differential activation of cells. 
The approach is limited, of course, by the fact that only 
the visible protein spots are included in the assay, and it 
is well known that a substantial but far from complete 
fraction of cellular proteins are detected if a total cell 
lysate is separated by 2'DE. Proteins may not be 
detected in 2-DE gels because they are not abundant 
enough to be visualized by the detection method used, 
because they do not migrate within the boundaries (size, 
p/) resolved by the gel, because they are not soluble 
under the conditions used, or for other reasons. 

A difTerent way to use proteome analysis as a biological 
assay to define the state of a biological system is to take 
advantage of the wealth of information contained in 
2-DE protein patterns. 2-DE is referred to as two-dimen- 
sional because of the electrophoretic mobility and the 
isoelectric points which define the position of each pro- 
tein in a 2-DE pattern. In addition to the two dimen- 
sions used to generate the protein patterns, a number of 
additional data dimensions are contained in the protein 
patterns. Some of these dimensions such as protein 
expression level, phosphorylation state, subcellular loca- 
tion, association with other proteins, rate of synthesis or 
degradation indicate the activity state of a protein or a 
biological system. Comparative analysis of 2-DE protein 
patterns representing different states is therefore ideally 
suited for the detection, identification and analysis of 
suitable markers. Once again it must be emphasized that 
in this type of experiment only a fraction of the cellular 
proteins is analyzed. Since many regulatory proteins are 
of low abundance, this limitation is a concern, particu- 
larly in cases in which regulatory pathways are being 
investigated. 

5 Concluding remarks 

In this report we have addressed three main issues 
related to proteome analysis. First, we have discussed 
the rationale for studying proteomes. Second, we have 
assessed the technical feasibility of analyzing proteomes 
and described current proteome technology, and third, 
we have analyzed the utility of proteome analysis for bio- 
logical research. It is apparent that proteome analysis is 
an essential tool in the analysis of biological systems. 
The multi-level control of protein synthesis and degrada- 
tion in cells means that only the direct analysis of 
mature protein products can reveal their correct identi- 
ties, their relevant state of modification and/or associa- 
tion and their amounts. Recently developed methods 
have enabled the identification of proteins at ever- 
increasing sensitivity levels and at a high level of auto- 
mation of the analytical processes. A number of tech- 
nical challenges, however, remain. While it is currently 
possible to identify essentially any protein spots that can 
be visualized by common staining methods, it is ap- 
parent that without prior enrichment only a relatively 
small and highly selected population of long-lived, 
highly expressed proteins is observed. There are many 
more proteins in a given cell which are not visualized by 
such methods. Frequently it is the low abundance pro- 
teins that execute key regulatory functions. 



We have outlined the two principal ways proteome anal- 
ysis is currently being used to intersect with biological 
research projects: the proteome as a database or data 
archive and proteome analysis as a biological assay. Both 
approaches have in common that at present they are con- 
ceptually and technically limited. Current proteome data- 
bases typically are limited to one cell type and one state 
of a cell and therefore do not account for the dynamics 
of biological systems. The use of proteome analysis as a 
biological assay can provide a wealth of information, but 
it is limited to the proteins detected and is therefore not 
truly proteome-wide. These limitations in proteomics are 
to a large extent a reflection of the fact that proteins in 
their fully processed form cannot easily be amplified and 
are therefore difficult to isolate in amounts sufficient for 
analysis or experimentation. The fact that to date no 
complete proteome has been described further attests to 
these difficulties. With continued rapid progress in pro- 
tein analysis technology, however, we anticipate that the 
goal of complete proteome analysis will eventually 
become attainable. 
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We have determined the relationship between mRNA and protein expression levels for selected genes 
expressed in the yeast Saccharomyces cerevisiae growing at mid-log phase. The proteins contained in total yeast 
cell lysate were separated by high-resolution two-dimensional (2D) gel electrophoresis. Over 150 protein spots 
were excised and identified by capillary liquid chromatography-tandem mass spectrometry (LC-MS/MS). 
Protein spots were quantified by metabolic labeling and scintillation counting. Corresponding mRNA levels 
were calculated from serial analysis of gene expression (SAGE) frequency tables (V. E. Velculescu, L. Zhang, 
W. Zhou, J. Vogelstein, M. A. Basrai, D. E. Bassett, Jr., P. Hieter, B. Vogelstein, and K. W. Kinzler, Cell 
88:243-251, 1997). We found that the correlation between mRNA and protein levels was insufficient to predict 
protein expression levels from quantitative mRNA data. Indeed, for some genes, ^ile the mRNA levels were 
of the same value the protein levels varied by more than 20-fold. Conversely, invariant steady-state levels of 
certain proteins were observed with respective mRNA transcript levels that varied by as much as 30-fold. 
Another interesting observation is that codon bias is not a predictor of either protein or mRNA levels. Our 
results dearly delineate the technical boundaries of current approaches for quantitative analysis of protein 
expression and reveal that simple deduction from mRNA transcript analysis is insufficient 



The description of the state of a biological system by the 
quantitative measurement of the system constituents is an es- 
sential but largely unexplored area of biology. With recent 
technical advances including the development of differential 
display-PCR (21), of cDNA microarray and DNA chip tech- 
nology (20, 27), and of serial analysis of gene expression 
(SAGE) (34, 35), it is now feasible to establish global and 
quantitative mRNA expression profiles of cells and tissues in 
species for which the sequence of all the genes is known. 
However, there is emerging evidence which suggests that 
mRNA expression patterns are necessary but are by them- 
selves insufficient for the quantitative description of biological 
systems. This evidence includes discoveries of posttranscrip- 
tional mechanisms controlling the protein translation rate (15), 
the half-lives of specific proteins or mRNAs (33), and the 
intracellular location and molecular association of the protein 
products of expressed genes (32). 

Proteome analysis, defined as the analysis of the protein 
complement expressed by a genome (26), has been suggested 
as an approach to the quantitative description of the state of a 
biological system by the quantitative analysis of protein expres- 
sion profiles (36). Proteome analysis is conceptually attractive 
because of its potential to determine properties of biological 
systems that are not apparent by DNA or mRNA sequence 
analysis alone. Such properties include the quantity of protein 
expression, the subcellular location, the state of modification, 
and the association with ligands, as well as the rate of change 
with time of such properties. In contrast to the genomes of a 
number of microorganisms (for a review, see reference 11) and 
the transcriptome of Saccharomyces cerevisiae (35), which have 
been entirely determined, no proteome map has been com- 
pleted to date. 

The most common implementation of proteome analysis is 
the combination of two-dimensional gel electrophoresis (2DE) 
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(isoelectric focusing-sodium dodecyl sulfate [SDS]-polyacryl- 
amide gel electrophoresis) for the separation and quantitation 
of proteins with analytical methods for their identification. 
2DE permits the separation, visualization, and quantitation of 
thousands of proteins reproducibly on a single gel (18, 24). By 
itself, 2DE is strictly a descriptive technique. The combination 
of 2DE with protein analytical techniques has added the pos- 
sibility of establishing the identities of separated protems (1, 2) 
and thus, m combination with quantitative mRNA analysis, of 
correlating quantitative protein and mRNA expression mea- 
surements of selected genes. 

The recent introduction of mass spectrometric protein anal- 
ysis techniques has dramatically enhanced the throughput and 
sensitivity of protein identification to a level which now permits 
the large-scale analysis of proteins separated by 2DE. The 
techniques have reached a level of sensitivity that permits the 
identification of essentially any protein that is detectable in the 
gels by conventional protein staining (9, 29). Current protein 
analytical technology is based on the mass spectrometric gen- 
eration of peptide fragment patterns that are idiotypic for the 
sequence of a protein. Protein identity is established by corre- 
lating such fragment patterns with sequence databases (10, 22, 
37). Sophisticated computer software (8) has automated the 
entire process such that proteins are routinely identified with 
no human interpretation of peptide fragment patterns. 

In this study, we have analyzed the mRNA and protein levels 
of a group of genes expressed in exponentially growing cells of 
the yeast S. cerevisiae. Protein expression levels were quantified 
by metabolic labeling of the yeast proteins to a steady state, 
followed by 2DE and hquid scintillation counting of the se- 
lected, separated protein species. Separated proteins were 
identified by in-gel tryptic digestion of spots with subsequent 
analysis by microspray liquid chromatography-tandem mass 
spectrometry (LC-MS/MS) and sequence database searching. 
The corresponding mRNA transcript levels were calculated 
from SAGE frequency tables (35). 

This study, for the first time, explores a quantitative com- 
parison of mRNA transcript and protein expression levels for 
a relatively large number of genes expressed in the same met- 
abolic state. The resultant correlation is insufficient for predic- 
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FIG. 1. Schematic iUustration of proteome analysis by 2DE and mass spectrometry. In part I, proteins are separated by 2DE, stained spots are excised and subjected 
to in-gel digestion with trypsin, and the resulting peptides are separated by on-line capillary high-performance liquid chromatography. In part II, a peptide is shown 
eiuting from the column in part I. The peptide is ionized by electrospray ionization and enters the mass spectrometer. The mass of the ionized peptide is detected, and 
the first quadrupole mass filter allows only the specific mass-to-charge ratio of the selected peptide ion to pass into the collision cell. In the collision cell, the energized, 
ionized peptides collide with neutral argon gas molecules. Fragmentation of the peptide is essentially random but occurs mainly at the peptide bonds, resulting in smaller 
peptides of differing lengths (masses). These peptide fragments arc detected as a tandem mass (MS/MS) spectrum in the third quadrupole mass filter where two ion 
series are recorded simultaneously, one each from sequencing inward from the N and C termini of the peptide, respectively. In part III, the MS/MS spectrum from the 
selected, ionized peptide is compared to predicted tandem mass spectra computer generated from a sequence database. Provided that the peptide sequence exists in 
the database, the peptide and, by association, the protein from which the peptide was derived can be identified. Unambiguous protein identification is attained in a single 
analysis because multiple peptides are identified as being derived from the same protein. 



tion of protein levels from mRNA transcript levels. We have 
also compared the relative amounts of protein and mRNA 
with the respective codon bias values for the corresponding 
genes. This comparison indicates that codon bias by itself is 
insufficient to accurately predict either the mRNA or the pro- 
tein ejq^ression levels of a gene. In addition, the results dem- 
onstrate that only highly expressed proteins are detectable by 
2DE separation of total cell lysates and that therefore the 
construction of complete proteome maps with current technol- 
ogy will be very challenging, irrespective of the type of organ- 
ism. 

MATERIALS AND MFTHODS 

Yeast strain and growth conditions. The source of protein and message tran- 
scripts for all experiments was YPH499 {MATa umS-Sl fys2-802 ade2-101 
leu2-^l his3'6200 trpl'^63) (30). Logarithmically growing cells were obtained by 
growing yeast ceils to early log phase (3 X lO'^ cells/ml) in YPD rich medium 
(YPD supplemented with 6 mM uracil, 4.8 mM adenine, and 24 mM tryptophan) 
at 30"C (35). Metabolic labeling of protein was accomplished in YPD medium 



exactly as described elsewhere (4) with the exception that 1 ml of cells was 
labeled with 3 mCi to offset methionine present in YPD medium. Protein was 
harvested as described by Garrels and coworkers (12). Harvested protein was 
lyophilized, resuspended in isoelectric focusing gel rehydration solution, and 
stored at -80*'C 

2DE. Soluble proteins were run in the first dimension by using a commercial 
flatbed electrophoresis system (Multiphor II; Pharmacia Biotech). Immobilized 
polyacrylamide gel (IPG) dry strips with nonlinear pH 3.0 to 10.0 gradients 
(Amersham-Pharmacia Biotech) were used for the first-dimension separation. 
Forty micrograms of protein from whole-cell lysates was mixed with IPG strip 
rehydration buffer (8 M urea, 2% Nonidet P-40, 10 mM dithiothreitol), and 250 
to 380 pi of solution was added to individual lanes of an IPG strip rehydration 
tray (Amersham-Pharmacia Biotech). The strips were allowed to rehydrate at 
room temperature for 1 h. The samples were run at 300 V-10 mA-5 W for 2 h, 
then ramped to 3,500 V-10 mA-5 W over a period of 3 h, and then kept at 3,500 
V-10 mA-5 W for 15 to 19 h. At the end of the first-dimension run (60 to 70 kV • 
h), the IPG strips were reequilibrated for 8 min in 2% (wt/vol) dithiothreitol in 
2% (wt/vol) SDS-6 M urea-30% (wt^ol) glycerol-0.05 M Tris HQ (pH 6.8) and 
for 4 min in 2JS% iodoacetamide in 2% (wt/vol) SDS-6 M urea-30% (wt/vol) 
glyceroM).05 M Tris HCI (pH 6.8). Following reequilibration, the strips were 
transferred and apposed to 10% polyacrylamide second-dimension gels. Poly- 
acrylamide gels were poured in a casting stand with 10% acryIamide-2.67% 
pipcrazine diacryiamide-0.375 M Tris base-HQ (pH 8.8)-0.1% (wt/vol) SDS-0.05% 
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FIG. 2. 2D sUver-stained gel of the proteins in yeast total cell lysate. Proteins were separated in the first dimension (horizontal) by isoelectric focusing and then in 
the second dimension (vertical) by molecular weight sieving. Protein spots (156) were chosen to include the entire range of molecular weights, isoelectric focusing points, 
and staining intensities. Spots were excised, and the corresponding protein was identified by mass spectrometry and database searching. The spots are labeled on the 
gel and correspond to the data presented in Table 1. Molecular weights are given in thousands. 



(wt/vol) ammonium persulfate-O.05% TEMED (/V//,/V',Ar'-tctramcthylcthyl- 
enediamine) in Milii-Q water. The apparatus used to run second-dimension gels 
was a nonconrnierdal apparatus from Oxford Glycosciences, Ina Once the IPG 
strips were apposed to the second-dimension gels, they were inunediately run at 
50 mA (constant)-500 V-85 W for 20 min, followed by 200 mA (constant)-500 
V-85 W until the buffer front line was 10 to 15 mm from the bottom of the gel. 
Gels were removed and silver stained according to the procedure of Shevchenko 
et al. (29). 

Protein identification. Gels were exposed to X-ray film overnight, and then the 
silver staining and film were used to excise 156 spots of varying intensities, 
molecular weights, and isoelectric focusing points. In order to increase the 
detection limit by mass spectrometry, spots were cut out and pooled from up to 
four identical cold, silver-stained gels. In-gel tryptic digests of pooled spots were 
performed as described previously (29). Tryptic peptides were analyzed by mi- 
crocapiUary LC-MS with automated switching to MS/MS mode for peptide 
fragmentation. Spectra were searched against the composite OWL protein se- 
quence database (version 30.2; 250^14 protein sequences) (24a) by using the 
computer program Sequest (8), which matches theoretical and acquired tandem 
mass spectra. A protein match was determined by comparing the number of 
peptides identified and their respecthre cross-correlation scores. All protein 
identifications were verified by comparison with theoretical molecular weights 
and isoelectric points. 



mRNA quantitation. Velculescu and coworkers have previously generated 
frequency tables for yeast mRNA transcripts from the same strain grown under 
the same stated conditions as described herein (35). The SAGE technology is 
based on two main principles. First, a short sequence tag (15 bp) that contains 
sufficient information uniquely to identify a transcript is generated. A single tag 
is usually generated from eadi mRNA transcript in the cell which corresponds to 
15 bp at the 3 '-most cutting site for Ma III. Second, many transcript tags can be 
concatenated into a single molecule and then sequenced, revealing the identity of 
multiple tags simultaneously. Over 20,000 transcripts were sequenced from yeast 
strain YPH499 growing at mid-log phase on glucose. Assuming the previously 
derived estimate of 15,000 mRNA molecules per cell (16), this would represent 
a 1.3-fold coverage even for mRNA molecules present at a single copy per cell 
and would provide a 72% probability of detecting such transcripts. Computer 
software which took for input the gene detected, examined the nucleotide se- 
quence, and performed the calculation as described by Velculescu and coworkers 
(35) was written. In practice, we found that for 21 of 128 (16%) genes examined 
viable mRNA levels from SAGE data could not be calculated. This was because 
(i) no CATG site was found m the open reading frame (ORF), (ii) a CATG site 
was found but the corresponding 10-bp putative SAGE tag was not found in the 
frequency tables, or (iii) identical putative SAGE tags were present for multiple 
genes (e.g., TDH2_YEAST and TDH3_YEAST). 
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TABLE 1. Expressed genes identified from 2D gel in Fig. 2 



Mol wt 


pl 


Spot no. 


YPD gene 
namc^ 


Protein 
abundance 
(lO' copies/ 
cell) 


mRNA 
abundance 
(copies/cell) 


Codon 
bias 


17,259 


6.75 


133 


CPRl 


15.2 


61.7 


0.769 


18,702 


4.80 


83 


EGD2 


20.1 


5.2 


0.724 


18,726 


4.44 


147 


YKL056C 


61.2 


88.4 


0.831 


18,978 


5.95 


135 


YER067W 


3.7 


6.7 


0.118 


19,108 


5.04 


130 


YLR109W 


94.4 


9.7 


0.680 


19,681 


9.08 


136 


ATP7 


11.0 


NA**^ 


0.246 


20,505 


6.07 


111 


GUKl 


16.5 


3.7 


0.422 


21,444 


5.25 


148 


SARI 


5.4 


10.4 


0.455 


21,583 


4.98 


95 


TSAl 


110.6 


40.1 


0.845 


22,602 


4.30 


80 


EFBl 


66.1 


23.8 


0.875 


23,079 


6.29 


112 


S0D2 


1Z6 


2.2 


0.351 


23,743 


5,44 


137 


HSP26 


NA' 


0.7 


0.434 


24,033 


5.97 


96 


ADKl 


17,4 


16.4 


0.656 


24,058 


4.43 


143 


YKL117W 


29.2 


10.4 


0.339 


24,353 


6.30 


140 


TFSl 


8.1 


0.7 


0.146 


24,662 


5.85 


99 


URA5 


25.4 


6.0 


0.359 


24,808 


6.33 


97 


GSPl 


26.3 


5.2 


0.735 


24,908 


8.73 


122 


RPS5 


18,6 


NA*^ 


0.899 


25,081 


4.65 


81 


MRP8 


9.3 


NA'^ 


0.241 


25,960 


6.06 


116 


RPEl 


5.8 


0.7 


0.372 


26,378 


9.55 


127 


RPS3 


96.8 


NA*^ 


0.863 


26,467 


5.18 


100 


VMA4 


10.5 


3.7 


0.427 


26,661 


5.84 


98 


TPIl 


NA** 


NA^ 


0.900 


27,156 


5.56 


93 


PRE8 


6.9 


0.7 


0.129 


27,334 


6.13 


115 


YHR049W 


18.4 


2.2 


0.520 


27,472 


5.33 


92 


YNLOlOW 


31.6 


3.7 


0,421 


27,480 


8.95 


123 


GPMl 


10.0 


169.4 


0.902 


27,480 


8.95 


124 


GPMl 


231.4 


169.4 


0.902 


27,480 


8.95 


125 


GPMl 


7.5 


169.4 


0.902 


27,809 


5.97 


139 


HOR2 


5.7 


0.7 


0.381 


27,874 


4.46 


78 


YSTl 


13.6 


52.8 


0.805 


28,595 


4.51 


41 


PUP2 


4.4 


0.7 


0.147 


29,156 


6.59 


114 


YMR226C 


14.5 


2,2 


0.283 


29^44 


8.40 


120 


DPMI 


5.0 


11.2 


0.362 


29,443 


5.91 


48 


PRE4 


3.4 


3.7 


0.162 


30,012 


6,39 


138 


PRBl 


21,2 


1.5 


0.449 


30,073 


4.63 


77 


BMHl 


14.7 


28.2 


0.454 


30,296 


7.94 


121 


OMP2 


67.4 


41.6 


0.499 


30,435 


634 


89 


GPPl 


70.2 


11.2 


0.703 


31,332 


5.57 


88 


ILV6 


13.9 


3.0 


0.402 


32,159 


5.46 


113 


IPPl 


63.1 


3.7 


0.752 


32,263 


6.00 


149 


HISl 


22.4 


4.5 


0.232 


33311 


5.35 


84 


SPE3 


15.1 


6.7 


0.468 


34,465 


5.60 


129 


ADEl 


8.7 


5.2 


0.305 


34,762 


5.32 


85 


SEC14 


10.9 


6.0 


0.373 


34,797 


5.85 


42 


URAl 


49.5 


8.9 


0.237 


34,799 


6.04 


90 


BBLl 


103.2 


81.0 


0.875 


35,556 


5.97 


43 


YDL124W 


6.4 


4.5 


0.206 


35,619 


8.41 


59 


TDHl 


69.8 


32.r 


0.940 


35,650 


5.49 


68 


CARl 


5.2 


3,0 


0.339 


35,712 


6.72 


117 


TDH2 


49.6 


473.0" 


0.982 


35,712 


6.72 


154 


TDH2 


863.5 


473.0" 


0.982 


35,712 


6.72 


155 


TDH2 


79.4 


473.0^ 


0.982 


36,272 


4,85 


128 


APAl 


8.7 


0.7 


0.425 


36,358 


5.05 


75 


YJR105W 


17.6 


17.1 


0.522 


36,358 


5,05 


76 


YJR105W 


27.5 


17.1 


0.522 


36,596 


6.37 


79 


ADH2 


58.9 


260.0^ 


0.711 


36,714 


6.30 


102 


ADHl 


746.1 


260.0 


0.913 


36,714 


6.30 


103 


ADHl 


17.6 


260.0 


0.913 


36,714 


6.30 


104 


ADHl 


61.4 


260.0 


0.913 


36,714 


6.30 


105 


ADHl 


52.7 


260.0 


0.913 


37,033 


6.23 


44 


TALI 


44.8 


3.7 


0.701 


37,796 


7.36 


57 


IDH2 


29.4 


6.7 


0.330 


37,886 


6.49 


106 


ILV5 


76.0 


4.5 


0.892 


38,700 


7.83 


55 


BATl 


30,9 


11.2 


0.469 


38,702 


6.24 


46 


QCR2 


NA'' 


2.2 


0.326 
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TABLE \— Continued 



Mol wt 




Spot no. 


YPD gene 
name" 


Protein 

abundance 
(lO' copies/ 
cell) 


mRNA 
abundance 
(copies/cell) 


Codon 
bias 


39,477 


5.58 


86 


FBAl 


17.8 


183.6 


0.935 


39,477 


5.58 


87 


FBAl 


427.2 


183.6 


0.935 


39,540 


6.50 


150 


H0M2 


60.3 


4.5 


0.592 


39,561 


6.12 


156 


PSAl 


96.4 


27.5 


0.718 


41,158 


6.01 


49 


YNL134C 


14.9 


1.5 


0.316 


41,623 


7.18 


58 


BAT2 


19.0 


8.9 


0.250 


41,728 


7.29 


110 


ERGIO 


24.1 


4.5 


0.543 


41,900 


5.42 


74 


TOM40 


22.3 


2.2 


0.375 


42,402 


6.29 


45 


CYS3 


6.7 


8.9 


0,621 


42,883 


5.63 


67 


DYSl 


15.8 


5.2 


0.526 


43,409 


6.31 


107 


SERl 


10.5 


1.5 


0.292 


43,421 


5.59 


91 


ERG6 


2.2 


14.1 


0.408 


44,174 


7.32 


56 


YBR025C 


13.1 


6.0 


0.684 


44,682 


4.99 


72 


TIFl 


2.9 


39.4 


0.834 


44,707 


7,77 


108 


PGKl 


23.7 


165.7 


0.897 


44,707 


7.77 


109 


PGKl 


315.2 


165.7 


0.897 


46,080 


6.72 


30 


CAR2 


15.4 


NA" 


0.495 


46,383 


8.52 


53 


IDPl 


7.7 


0.7 


0.436 


46,553 


5.98 


47 


IDP2 


32,4 


NA" 


0.197 


46,679 


6.39 


50 


ENOl 


35.4 


0.7 


0.930 


46,679 


6.39 


51 


ENOl 


6.6 


0.7 


0,930 


46,679 


6.39 


52 


ENOl 


2.2 


0.7 


0,930 


46,773 


5.82 


63 


EN02 


15,5 


289.1 


0.960 


46,773 


5.82 


64 


EN02 


635.5 


289.1 


0.960 


46,773 


5.82 


65 


EN02 


93.0 


289.1 


0.960 


46,773 


5.82 


66 


EN02 


31.0 


289.1 


0.960 


47,402 


6.09 


126 


CORl 


2.5 


0.7 


0.422 


47,666 


8.98 


54 


AAT2 


11.7 


6.0 


0.338 


48,364 


5.25 


73 


WTMl 


74.5 


13.4 


0.365 


48,530 


6.20 


61 


MET17 


38.1 


29.0 


0,576 


48,904 


5.18 


69 


LYS9 


16.2 


3.7 


0.463 


48,987 


4.90 


153 


SUP45 


29.6 


11.9 


0.377 


49,727 


5.47 


70 


PR02 


13.6 


5,2 


0.297 


49,912 


9.27 


62 


TEF2 


558.5 


282.0 


0.932 


50,444 


5.67 


35 


YDR190C 


4.8 


2.2 


0.228 


50,837 


6.11 


32 


YEL047C 


3.8 


1.5 


0.387 


50,891 


4.59 


151 


TUB2 


11.2 


7.4 


0.404 


51,547 


6.80 


27 


LPDl 


18.9 


2.2 


0,351 


52,216 


7.25 


29 


SHM2 


19.7 


7.4 


0.722 


52,859 


554 


37 


YFR044C 


30.2 


6.7 


0,442 


53,798 


5.19 


71 


HXK2 


26.5 


7.4 


0.756 


53.803 


6.05 


145 


GYP6 


4.4 


0.7 


0.147 


54,403 


5.29 


39 


ALD6 


37.7 


2.2 


0.664 


54,403 


5.29 


40 


ALD6 


6.6 


Z2 


0.664 


54,502 


6.20 


31 


ADE13 


6.3 


1.5 


0.417 


54,543 


7.75 


25 


PYKl 


225.3 


101.8 


0.965 


54,543 


7.75 


26 


PYKl 


39.8 


101.8 


0.965 


55,221 


6.66 


146 


YEL071W 


16.3 


3.0 


0.244 


55,295 


4.35 


134 


PDIl 


66,2 


14.1 


0.589 


55,364 


5.98 


24 


GLKl 


22.6 


6.0 


0.237 


55,481 


7.97 


118 


ATPl 


21.6 


2.2 


0.637 


55,886 


6.47 


28 


CYS4 


22.2 


NA*" 


0.444 


56,167 


5.83 


33 


AR08 


14.3 


3.0 


0.324 


56,167 


5.83 


34 


AR08 


9.1 


3.0 


0.324 


56,584 


6.36 


20 


CYB2 


18.9 


NA" 


0.259 


57,366 


5.53 


60 


FRS2 


2.3 


0.7 


0.451 


57,383 


5.98 


144 


ZWFl 


5.6 


0.7 


0.215 


57,464 


5.49 


36 


THR4 


21.4 


3.7 


0.508 


57,512 


5.50 


7 


SRV2 


6.5 


NA" 


0.260 


57,727 


4.92 


152 


VMA2 


33.7 


8.9 


0.546 


58,573 


6.47 


17 


ACHl 


4.4 


1.5 


0.327 


58,573 


6.47 


18 


ACHl 


5.4 


1.5 


0.327 


61,353 


5.87 


21 


PDCl 


6.5 


200.7 


0.962 


61,353 


5,87 


22 


PDCl 


303.2 


200.7 


0.962 


61,353 


5.87 


23 


PDCl 


16.3 


200.7 


0.962 


61,649 


5.54 


38 


CCT8 


2.2 


1.5 


0.271 
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Mol wt 




Spot no. 


YrU gene 
nsinc 


Protein 
abundance 
copies/ 
cell) 


mRNA 
abundance 
(copieVcell) 


Codon 
bias 


61,902 


6.21 


101 


PDC5 


4.3 


NA*^ 


0.828 


62,266 


6.19 


16 


ICLl 


20.1 


NA*^ 


0.327 


62,862 


8.02 


19 


ILV3 


5.3 


4.5 


0.548 


63,082 


6.40 


119 


PGM2 


2.2 


3.0 


0.402 


64,335 


5.77 


5 


PABl 


30.4 


1.5 


0.616 


66,120 


5.42 


8 


STIl 


6.7 


0.7 


0.313 


66,120 


5.42 


9 


STIl 


6.4 


0.7 


0.313 


66,450 


5,29 


141 


SSB2 


7.0 


NA'^ 


0.880 


66,450 


5.29 


142 


SSB2 


2.3 


NA-^ 


0.880 


66,456 


5.23 


10 


SSBl 


64.5 


79.5 


0.907 


66,456 


5.23 


11 


SSBl 


59.0 


79.5 


0.907 


66,456 


5.23 


12 


SSBl 


13.7 


79.5 


0.907 


68,397 


5.82 


82 


LEU4 


3.1 


3.0 


0.407 


69,313 


4.90 


13 


SSA2 


24.3 


18.6 


0.892 


69,313 


4.90 


14 


SSA2 


77.1 


18.6 


0.892 


74,378 


8.46 


15 


YKL029C 


2.8 


3.7 


0.353 


75,396 


5.82 


6 


GRSl 


5.5 


7.4 


0.500 


85,720 


6.25 


1 


MET6 


2.0 


NA^ 


0,772 


85,720 


6.25 


2 


MET6 


10.9 


NA" 


0.772 


85,720 


6.25 


3 


MET6 


1.4 


NA" 


0,772 


93,276 


6,11 


131 


EFTl 


17.9 


41.6 


0.890 


93,276 


6.11 


132 


EFTl 


5.7 


41.6 


0.890 


102,064" 


6.6r 


94 


ADE3 


4.8 


5.2 


0.423 


107.482" 


5.33^ 


4 


MCM3 


2.7 


NA*^ 


0.240 



" YPD gene names are available from the YPD website (39). 
* NA, calculation could not be performed or was not available. 
mRNA data inconclusive or NA. 

No methionines in predicted ORF; therefore, protein concentration was not 
determined. 

Measured molecular weight or pi did not match theoretical molecular weight 
or pi. 



Protein quantitation. [^^S]mcthioninc-labeled gels were exposed to X-ray film 
overnight, and then the silver stain and film were used to excise 156 spots of 
varying intensities, molecular weights, and pis. The excised spots were placed in 
G.6-nil microcentrifuge tubes, and scintillation cocktail (100 ^1) was added. The 
samples were vortexed and counted. In addition, two parallel gels were electro- 
blotted to polyvinylidene difluoride membranes. The membranes were exposed 
to X-ray film, and four intense single spots were excised from each membrane 
and subjected to amino acid analysis. For these four spots, a mean of 209 ± 4 
cpm/pmol of protein/me thionine was found. This number was used to quantitate 
all remaining spots in conjunction with the number of methionines present in the 
protein. 

To ensure that proteins were labeled to equUibrium, parallel 2D gels were 
prepared and run on yeast metabolically labeled for 1, 2, 6, or 18 h. The 
corresponding 156 spots were excised from each gel, and radioactivity was mea- 
sured by liquid scintillation counting for each spot. Calculated protein levels were 
highly reproducible for all time points measured after 1 h. 

Calculation of codoo bias and predicted half-life. Codon bias values were 
extracted from the YPD spreadsheet (17). Protein half-lives were calculated 
based on the N-end rule (33). When the N-terminal processing was not known 
experimentally, it was predicted based on the affinity of methionine aminopep- 
tidase (31). 

RESULTS 

Characteristics of proteome approach. Nearly eveiy facet of 
proteome analysis hinges on the unambiguous identification of 
large numbers of expressed proteins in cells. Several tech- 
niques have been described previously for the identification of 
proteins separated by 2DE, including N-terminal and internal 
sequencing (1, 2), amino acid analysis (38), and more recently 
mass spectrometry (25). We utilized techniques based on mass 
spectrometry because they afford the highest levels of sensitiv- 
ity and provide unambiguous identification. The specific pro- 
cedure used is schematically illustrated in Fig. 1 and is based 
on three principles. First, proteins are removed from the gel by 



proteolytic in-gel digestion, and the resulting peptides are sep- 
arated by on-line capillary high-performance liquid chromatog- 
raphy. Second, the eluting peptides are ionized and detected, and 
the specific peptide ions are selected and fragmented by the 
mass spectrometer. To achieve this, the mass spectrometer 
switches between the MS mode (for peptide mass identifica- 
tion) and the MS/MS mode (for peptide characterization and 
sequencing). Selected peptides are fragmented by a process 
called collision-induced dissociation (CID) to generate a tan- 
dem mass spectrum (MS/MS spectrum) that contains the pep- 
tide sequence information. Third, individual CID mass spectra 
are then compared by computer algorithms to predicted spec- 
tra from a sequence database. This results in the identification 
of the peptide and, by association, the protein(s) in the spot. 
Unambiguous protein identification is attained in a single anal- 
ysis by the detection of multiple peptides derived from the 
same protein. 

Protein identification. Yeast total cell protein lysate (40 jxg), 
metabolically labeled with [^^S]methionine, was electro- 
phoreticaUy separated by isoelectric focusing in the first dimen- 
sion and by SDS-10% polyaciylamide gel electrophoresis in 
the second dimension. Proteins were visualized by silver stain- 
ing and by autoradiography. Of the more than 1,000 proteins 
visible by silver staining, 156 spots were excised from the gel 
and subjected to in-gel tryptic digestion, and the resulting 
peptides were analyzed and identified by microspray LC- 
MS/MS techniques as described above. The proteins in this 
study were all identified automatically by computer software 
with no human interpretation of mass spectra. They are indi- 
cated in Fig. 2 and detailed in Table 1. 

The CID spectra shown in Fig. 3 indicate that the quality of 
the identification data generated was suitable for unambiguous 
protein identification. The spectra represent the amino acid 
sequences of tryptic peptides NSGDIVNLGSIAGR (Fig. 3A) 
and FAVGAFTDSLR (Fig. SB). Both peptides were derived 
from protein S57593 (hypothetical protein YMR226C), which 
migrated to spot 114 (molecular weight, 29,156; pi, 6.59) in the 
2D gel in Fig. 2. Five other peptides from the same analysis 
were also computer matched to the same protein sequence. 

Protein and mRNA quantitation. For the 156 genes investi- 
gated, the protein expression levels ranged from 2,200 (PGM2) 
to 863,000 (TDH2/TDH3) copies/cell. The levels of mRNA for 
each of the genes identified were calculated from SAGE fre- 
quency tables (35). These tables contain the mRNA levels for 
4,665 genes in yeast strain YPH499 grown to mid-log phase in 
YPD medium on glucose as a carbon source. In some in- 
stances, the mRNA levels could not be calculated for reasons 
stated in Materials and Methods. For the proteins analyzed in 
this study, mean transcript levels varied from 0.7 to 473 copies/ 
cell. 

Selection of the sample population for mRNA-protein ex- 
pression level correlation. The protein spots selected for iden- 
tification were selected from spots visible by silver staining in 
the 2D gel. An attempt was made not to include spots where 
overlap with other spots was readily apparent. The number of 
proteins identified was 156 (Table 1). Some proteins migrated 
to more than one spot (presumably due to differentia] protein 
processing or modifications), and protein levels from these 
spots were calculated by integrating the intensities of the dif- 
ferent spots. The 156 protein spots analyzed represented the 
products of 128 different genes. Genes were excluded from the 
correlation analysis only if part of the data set was missing; i.e., 
genes were excluded if (i) no mRNA expression data were 
available for the protein or putative SAGE tags were ambig- 
uous, (ii) the amino acid sequence did not contain methionine, 
(iii) more than a single protein was conclusively identified as 
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FIG. 3. Tandem mass (MS/MS) spectra resulting from analysis of a single spot on a 2D gel. The first quadrupole selected a single mass-to-charge ratio (m/z) of 687.2 
(A) or 592.6 (B), while the collision cell was filled with argon gas, and a voltage which caused the peptide to undergo fragmentation by CID was applied. The third 
quadrupole scanned the mass range from 50 to 1,400 m/z. The computer program Sequest (8) was utilized to match MS/MS spectra to amino acid sequence by database 
searching. Both spectra matched peptides from the same protein, S57593 (yeast hypothetical protein YMR226C). Five other peptides from the same analysis were 
matched to the same protein. 



migrating to the same gel spot, or (iv) the theoretical and 
observed pis and molecular weights could not be reconciled. 
After these criteria were applied, the number of genes used in 
the correlation analysis was 106. 



Codon bias and predicted half-lives. Codon bias is thought 
to be an indicator of protein expression, with highly expressed 
proteins having large codon bias values. The codon bias distri- 
bution for the entire set of more than 6,000 predicted yeast 
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gene ORFs is presented in Fig. 4A. The interval with the 
largest frequency of genes is between the codon bias values of 
0.0 and 0.1. This segment contains more than 2,500 genes. The 
distribution of the codon bias values of the 128 different genes 
found in this study (all protein spots from Fig. 2) is shown in 
Fig. 4B, and protein half-lives (predicted from applying the 
N-end rule [33] to the experimentally determined or predicted 
protein N termini) are shown in Fig. 4C. No genes were iden- 
tified with codon bias values less than 0.1 even though thou- 
sands of genes exist in this category. In addition, nearly all of 
the proteins identified had long predicted half-lives (greater 
than 30 h). 

Correlation of mRNA and protein expression levels. The 

correlation between mRNA and protein levels of the genes 
selected as described above is shown in Fig. 5. For the entire 
group (106 genes) for which a complete data set was gener- 
ated, there was a general trend of increased protein levels 
resulting from increased mRNA levels. The Pearson product 
moment correlation coefficient for the whole data set (106 
genes) was 0.935. This number is highly biased by a small 
number of genes with very large protein and message levels. A 
more representative subset of the data is shown in the inset of 
Fig, 5. It shows genes for which the message level was below 10 
copies/cell and includes 69% (73 of 106 genes) of the data used 
in the study. The Pearson product moment correlation coeffi- 
cient for this data set was only 0.356. We also found that levels 
of protein expression coded for by mRNA with comparable 
abundance varied by as much as 30-fold and that the mRNA 
levels coding for proteins with comparable expression levels 
varied by as much as 20-fold. 

The distortion of the correlation value induced by the un- 
even distribution of the data points along the x axis is further 
demonstrated by the analysis in Fig. 6. The 106 samples in- 
cluded in the study were ranked by protein abundance, and the 
Pearson product moment correlation coefficient was repeat- 
edly calculated after including progressively more, and higher- 
abundance, proteins in each calculation. The correlation values 
remained relatively stable in the range of 0.1 to 0.4 if the 
lowest-expressed 40 to 95 proteins used in this study were 
included. However, the correlation value steadily climbed by 
the inclusion of each of the 11 very highly expressed proteins. 

Correlation of protein and mRNA expression levels with 
codon bias. Codon bias is the propensity for a gene to utilize 
the same codon to encode an amino acid even though other 
codons would insert the identical amino add in the growing 
polypeptide sequence. It is further thought that highly ex- 
pressed proteins have large codon biases (3). To assess the 
value of codon bias for predicting mRNA and protein levels in 
exponentially growing yeast cells, we plotted the two experi- 
mental sets of data versus the codon bias (Fig. 7). The distri- 
bution patterns for both mRNA and protein levels with respect 
to codon bias were highly similar. There was high variability in 
the data within the codon bias range of 0.8 to 1.0. Although a 
large codon bias generally resulted in higher protein and mes- 
sage expression levels, codon bias did not appear to be predic- 
tive of either protein levels or mRNA levels in the cell. 

DISCUSSION 

The desired end point for the description of a biological 
system is not the analysis of mRNA transcript levels alone but 
also the accurate measurement of protein expression levels and 
their respective activities. Quantitative analysis of global 
mRNA levels currentiy is a preferred method for the analysis 
of the state of cells and tissues (11). Several methods which 
either provide absolute mRNA abundance (34, 35) or relative 
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mRNA levels in comparative analyses (20, 27) have been de- 
scribed elsewhere. The techniques are fast and exquisitely sen- 
sitive and can provide mRNA abundance for potentially any 
expressed gene. Measured mRNA levels are often implicitly or 
explicitly extrapolated to indicate the levels of activity of the 
corresponding protein in the cell. Quantitative analysis of pro- 
tein expression levels (proteome analysis) is much more time- 
consuming because proteins are analyzed sequentially one by 
one and is not general because analyses are limited to the 
relatively highly expressed proteins. Proteome analysis does, 
however, provide types of data that are of critical importance 
for the description of the state of a biological system and that 
are not readily apparent from the sequence and the level of 
expression of the mRNA transcript. This study attempts to 
examine the relationship between mRNA and protein expres- 
sion levels for a large number of expressed genes in cells 
representing the same state. 

Limits in the sensitivity of current protein analysis technol- 
ogy precluded a completely random sampling of yeast proteins. 
We therefore based the study on those proteins visible by silver 
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Staining on a 2D gel. Of the more than 1,000 visible spots, 156 
were chosen to include the entire range of molecular weights, 
isoelectric focusing points, and staining intensities displayed on 
the 2D protein pattern. The genes identified in this study 
shared a number of properties. First, all of the proteins in this 
study had a codon bias of greater than 0.1 and 93% were 
greater than 0.2 (Fig. 4B). Second, with few exceptions, the 
proteins in this study had long predicted half-lives according to 
the N-end rule (Fig. 4C). Third, low-abundance proteins with 
regulatory functions such as transcription factors or protein 
kinases were not identified. 

Because the population of proteins used in this study ap- 
pears to be fairly homogeneous with respect to predicted half- 
life and codon bias, it might be expected that the correlation of 
the mRNA and protein expression levels would be stronger for 
this population than for a random sample of yeast proteins. We 
tested this assumption by evaluating the correlation value if 
different subsets of the available data were included in the 
calculation. The 106 proteins were ranked from lowest to high- 
est protein expression level, and the trend in the correlation 
value was evaluated by progressively including more of the 
higher-abundance proteins in the calculation (Fig. 6). The cor- 
relation value when only the lower-abundance 40 to 93 pro- 
teins were examined was consistently between 0.1 and 0.4. If 
the 11 most abundant proteins were included, the correlation 
steadily increased to 0.94. We therefore expect that the corre- 
lation for all yeast proteins or for a random selection would be 
less than 0.4. The observed level of correlation between 
mRNA and protein expression levels suggests the importance 



of posttranslational mechanisms controlling gene expression. 
Such mechanisms include translational control (15) and con- 
trol of protein half-life (33). Since these mechanisms are also 
active in higher eukaryotic cells, we speculate that there is no 
predictive correlation between steady-state levels of mRNA 
and those of protein in mammalian cells. 

Like other large-scale analyses, the present study has several 
potential sources of error related to the methods used to de- 
termine mRNA and protein expression levels. The mRNA 
levels were calculated from frequenqr tables of SAGE data. 
This method is highly quantitative because it is based on actual 
sequencing of unique tags from each gene, and the number of 
times that a tag is represented is proportional to the number of 
mRNA molecules for a specific gene. This method has some 
limitations including the following: (i) the magnitude of the 
error in the measurement of mRNA levels is inversely propor- 
tional to the mRNA levels, (ii) SAGE tags from highly similar 
genes may not be distinguished and therefore are summed, (iii) 
some SAGE tags are from sequences in the 3' untranslated 
region of the transcript, (iv) incomplete cleavage at the SAGE 
tag site by the restriction enzyme can result in two tags repre- 
senting one mRNA, and (v) some transcripts actually do not 
generate a SAGE tag (34, 35). 

For the SAGE method, the error associated with a value 
increases with a decreasing number of transcripts per cell. The 
conclusions drawn from this study are dependent on the qual- 
ity of the mRNA levels from previously published data (35). 
Since more than 65% of the mRNA levels included in this 
study were calculated to 10 copies/cell or less (40% were less 
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than 4 copies/cell), the error associated with these values may 
be quite large. The mRNA levels were calculated from more 
than 20,000 transcripts. Assuming that the estimate of 15,000 
mRNA molecules per cell is correct (16), this would mean that 
mRNA transcripts present at only a single copy per cell would 
be detected 72% of the time (35). The mRNA levels for each 
gene were carefully scrutinized, and only mRNA levels for 
which a high degree of confidence existed were included in the 
correlation value. 

Protein abundance was determined by metabolic radiolabel- 
ing with p^SJmethionine. The calculation required knowledge 
of three variables: the number of methionines in the mature 
protein, the radioactivity contained in the protein, and the 
specific activity of the radiolabel normalized per methionine. 
The number of methionines per protein was determined from 
the amino acid sequence of the proteins identified by tandem 
mass spectrometry. For some proteins, it was not known 
whether the methionine of the nascent polypeptide was pro< 
cessed away. The N termini of those proteins were predicted 
based on the specificity of methionine aminopeptidase (31). If 
the N-terminal processing did not conform to the predicted 
specificity of processing enzymes, the calculation of the num- 
ber of methionines would be affected. This discrepancy would 
affect most the quantitation of a protein with a very low mmi- 
ber of methionines. The average number of calculated methi- 
onines per protein in this study was 7.2. We therefore expect 
the potential for erroneous protein quantitation due to un- 
usual N-terminal processing to be small. 



The amount of radioactivity contained in a single spot might 
be the sum of the radioactivity of comigrating proteins. Be- 
cause protein identification was based on tandem mass spec- 
trometric techniques, comigrating proteins could be identified. 
However, comigrating proteins were rarely detected in this 
study, most likely because relatively small amounts of total 
protein (40 jig) were initially loaded onto the gels, which re- 
sulted in highly focused spots containing generally 1 to 25 ng of 
protein. Because of the relatively small amount loaded, the 
concentrations of any potentially comigrating protein would 
likely be below the limit of detection of the mass spectrometry 
technique used in this study (1 to 5 ng) and below the limit of 
visualization by silver staining (1 to 5 ng). In the overwhelming 
majority of the samples analyzed, numerous peptides from a 
single protein were detected. It is assumed that any comigrat- 
ing proteins were at levels too low to be detected and that their 
influence in the calculation would be small. 

The specific activity of the radiolabel was determined by 
relating the precise amount of protein present in selected spots 
of a parallel gel, as determined by quantitative amino acid 
composition analysis, to the number of methionines present in 
the sequence of those proteins and the radioactivity deter- 
mined by liquid scintillation counting. It is possible that the 
resulting number might be influenced by unavoidable losses 
inherent in the amino acid analysis procedure applied. Because 
four different proteins were utilized in the calculation and the 
experiment was done in duplicate, the specific activity calcu- 
lated is thought to be highly accurate. Indeed, the specific 
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activities calculated for each of the four proteins varied by less 
than 10%. Any inconsistencies in the calculation of the specific 
activity would result in differences in the absolute levels calcu- 
lated but not in the relative numbers and would therefore not 
influence the correlation value determined. 

The protein quantitative method used eliminates a number 
of potential errors inherent in previous methods for the quan- 
titation of proteins separated by 2DE, such as preferential 
protein staining and bias caused by inequalities in the number 
of radiolabeled residues per protein. Any 2D gel-based method 
of quantitation is complicated by the fact that in some cases the 
translation products of the same mRNA migrated to different 
spots. One major reason is posttranslational modification or 
processing of the protein. Also, artifactual proteolysis during 
cell lysis and sample preparation can lead to multiple resolved 
forms of the protein. In such cases, the protein levels of spots 
coded for by the same mRNA were pooled. In addition, the 
existence of other spots coded for by the same mRNA that 
were not analyzed by mass spectrometry or that were below the 
limit of detection for silver staining cannot be ruled out. How- 
ever, since this study is based on a class of highly expressed 
proteins, the presence of undetected minor spots below silver 
staining sensitivity corresponding to a protein analyzed in the 
study would generally cause a relatively small error in protein 
quantitation. 

Codon bias is a measure of the propensity of an organism to 
selectively utilize certain codons which result in the incorpo- 
ration of the same amino acid residue in a growing polypeptide 
chain. There are 61 possible codons that code for 20 amino 
acids. The larger the codon bias value, the smaller the number 
of codons that are used to encode the protein (19). It is 



thought that codon bias is a measure of protein abundance 
because highly expressed proteins generally have large codon 
bias values (3, 13). 

Nearly all of the most highly expressed proteins had codon 
bias values of greater than 0.8. However, we detected a number 
of genes with high codon bias and relative low protein abun- 
dance (Fig. 7). For example, the expressed gene with both the 
second largest protein and mRNA levels in the study was 
EN02_YEAST (775,000 and 289.1 copies/cell, respectively). 
EN01_YEAST was also present in the gel at much lower 
protein and mRNA levels (44,200 and 0.7 copies/ceU, respec- 
tively). The codon bias values for EN02 and ENOl are similar 
(0.96 and 0.93, respectively), but the eT^ression of the two 
genes is differentially regulated. Specificalty, EN01_YEAST is 
glucose repressed (6) and was therefore present in low abun- 
dance under the conditions used. Other genes with large codon 
bias values that were not of high protein abundance in the gel 
include EFTl, TIFl, HXK2, GSPl, EGD2, SHM2, and TALI. 
We conclude that merely determining the codon bias of a gene 
is not suflicient to predict its protein expression level. 

Interestingly, codon bias appears to be an excellent indicator 
of the boundaries of current 2D gel proteome analysis tech- 
nology. There are thousands of genes with expressed mRNA 
and likely expressed protein with codon bias values less than 
0.1 (Fig. 4A). In this study, we detected none of them, and only 
a very small percentage of the genes detected in this study had 
codon bias values between 0.1 and 0.2 (Fig. 4B). Indeed, in 
every examined yeast proteome study (5, 7, 13, 28) where the 
combined total number of identified proteins is 300 to 400, this 
same observation is true. It is expected that for the more 
complex cells of higher eukaryotic organisms the detection of 
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low-abundance proteins would be even more challenging than 
for yeast. This indicates that highly abundant, long-lived pro- 
teins are overwhelmingly detected in proteome studies. If pro- 
teome analysis is to provide truly meaningful information 
about cellular processes, it must be able to penetrate to the 
level of regulatory proteins, including transcription factors and 
protein kinases. A promising approach is the use of narrow- 
range focusing gels with immobilized pH gradients (IPG) (23). 
This would allow for the loading of significantly more protein 
per pH unit covered and also provide increased resolution of 
proteins with similar electrophoretic mobilities. A standard pH 
gradient in an isoelectric focusing gel covers a 7-pH-unit range 
(pH 3 to 10) over 18 cm. A narrow-range focusing gel might 
expand the range to 0.5 pH units over 18 cm or more. This 
could potentially increase by more than 10-fold the number of 
proteins that can be detected. Clearly, current proteome tech- 
nology is incapable of analyzing low- abundance regulatory pro- 
teins without employing an enrichment method for relatively 
low-abundance proteins. In conclusion, this study examined 
the relationship between yeast protein and message levels and 
revealed that transcript levels provide little predictive value 
with respect to the extent of protein expression. 
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ABSTRACT 

Recent evidence suggests that many members of the human kallikrein 
(KLK) gene family are differentiaUy regulated in ovarian cancer and have 
potential as diagnostic and/or prognostic markers. We used the serial 
analysis of gene expression and expressed sequence tag databases of the 
Cancer Genome Anatomy Project to perform in sUico analyses of the 
expression pattern of the 15 human KLK genes in normal and cancerous 
ovarian tissues and cell lines. We found that seven KLK genes {KLKS, 
KLK6, KLK7, KLK8, KLKIO, KLKll, and KLK14) are up-regulated in 
ovarian cancer. Probing 2 normal and 10 ovarian cancer serial analysis of 
gene expression libraries with gene-specific tags for each KLK indicated 
that whereas no expression was detected in any normal libraries (with the 
exception of KLKIO and KLKll), these KLKs were found to be expressed 
with moderate densities (103-408 tags per million) in 40-60% of the 
ovarian cancer libraries analyzed. These data were verifled by screening 
the expressed sequence tag databases, where 78 of 79 mRNA clones 
isolated for these genes were from ovarian cancer libraries. X-profiler 
comparison of the pools of normal and cancerous ovaries identified a 
significant difference in expression levels for six of the seven KLKs. We 
experimentally verified the overexpression of six KLK proteins in cancer 
versus normal or benign tissues with highly sensitive and specific immun- 
ofluorometric assays. A statisticaUy significant stepwise increase in pro- 
tein levels was found among normal, benign, and cancerous ovarian 
tissues. The expression of five KLKs showed a strong degree of correlation 
at the protein level, suggesting the existence of a common mechanism or 
pathway that controls the expression of this group of adjacent genes 
during ovarian cancer progression. 

INTRODUCTION 

The CGAP^ is an international effort implemented by the National 
Cancer Institute to create a catalogue of the genes associated with 
cancer and to develop technological tools to support the analysis of 
the molecular profiles of cancer cells and their nonnal counterparts 
(1). Gene expression data contained in the electronic databases of the 
CGAP can be used to identify potentially informative marker genes 
expressed in cancer and to compare cDNA firequencies of genes of 
interest in normal versus cancer states. The project applies two main 
approaches, the EST method and the SAGE approach (2). Various 
analytical tools have also been developed for data analysis including 
BLAST, virtual Northem analysis, X-profiler, and digital differential 
display. These databases have been used successfully in recent years, 
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and results obtained from in silico analysis were experimentally 
verified in most cases. 

Epithelial ovarian cancer continues to be the leading cause of death 
from gynecological cancers in the United States, accounting for 
14,400 deaths/year (3). Because most patients are asymptomatic until 
the disease has metastasized, two-thirds are diagnosed in advanced 
stages, with survival rates of <20%. Currently, the only tumor marker 
that has a well-defmed and validated role in the management of 
ovarian cancer is CA125 (4), Serum CA125 has been evaluated for 
screening for ovarian cancer, differentiation between benign and 
malignant ovarian masses, and prognosis. This marker does not have 
a clear place in diagnosis, prognosis, or making treatment decisions, 
and it lacks specificity (5). Many potential new serum markers for 
ovarian cancer have been evaluated, either alone or in combination 
with CA125, including inhibin, prostasin, OVXl, LASA, CA15.3, 
and CA72-4 (4). However, none of them has a clear advantage over 
CA12S to date. Recently, microarray technology has been used to 
identify novel potential serum markers for ovarian cancer (6), and 
bioinformatic approaches have been successfully used to identify 
candidate genes for further analysis (7). 

KLKs are a family of 15 genes clustered together on chromosome 
19 and belong to the serine protease family of enzymes (8, 9). 
Prostate-specific antigen (PSA, hK3), a member of this family, is the 
premier tumor marker for prostate cancer (10). Accumulating evi- 
dence suggests a role for KLKs in cancer diagnosis and prognosis 
(11-15). 

In this study, we used bioinformatic tools to examine KLK gene 
expression in normal and cancerous ovarian tissues and cell lines. We 
found strong evidence that seven KLKs are up-regulated in ovarian 
cancer. We then experimentally verified the overexpression of six of 
these KLKs at the protein level in cancer tissues and found a signif- 
icant correlation between their concentration levels. 



MATERIALS AND METHODS 

VNBs. The mRNA sequences of the 15 human KLK genes were used to 
identify unique sequence tags of UniGene clusters for each KLK (for GenBank 
accession numbers, see Ref. 8). Two restriction digestion enzymes {NlaWl and 
Sau3A) were used as anchoring enzymes. These sequence tags were then used 
to determine the levels of expression of different KLKs in 2 normal and 10 
ovarian cancer libraries (4 from ovarian cancer tissues and 6 from ovarian 
cancer cell lines). Detailed information for these libraries is available from the 
web site of the CGAP.^ A list of all libraries used for analysis is submitted as 
supplementary information.^ Analyses were performed by comparing the pro- 
portion of libraries of each type (cancer versus nonnal) that show expression 
of each tag, in addition to the average expression densities in these libraries. 
mRNA sequences from the Human Genome Project were used as reference 
sequences. If more than one tag of the same gene appears in the same library, 
we only included the one with the peak level of expression (maximum tpm); 
the other tags were excluded to avoid inaccurate estimation of expression. 
Expression levels are displayed as blots with different densities and corrected 
as tpm to facilitate comparison. 



http://www.ncbi.nlm.nih.gov/ncicgap/. 
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EST Analysis. The full-length mRNA sequence of each KLK was com- 
pared aj;ainst human EST databases of the National Center for Biotech- 
nology Information. At the time of the study, these databases included libraries 
from 9 normal, 3 premalignant, and 17 cancerous tissues. Expression was 
calculated for each KLK as the number of positive libraries in each tissue type, 
in addition to the total number of clones detected in each type of library. 

X-proffler. X-profiler analyses of KLK gene expression were performed 
by comparing normal and cancerous ovarian libraries available in the SAGE 
databases. Because expression levels of various KLKs might be different from 
one cell line to another and in different types of ovarian cancer, we compared 
a pool of normal ovarian libraries (2 normal ovarian epithelial cell lines) 
against 10 different epithelial ovarian cancer libraries (4 from ovarian cancer 
tissues and 6 from ovarian cancer cell lines). Ovarian cancer libraries repre- 
sented different histological types including serous adenocarcinoma and clear 
cell carcinoma. The X-profiler cutoff value was set at 2-fold difference. 

Ovarian Tissue Cytosols. Human tissue extracts were prepared from 10 
histologically confirmed normal ovarian tissues, 10 benign ovarian tumors, and 
20 cancerous ovarian tissues. Tissue extraction was as follows: frozen human 
tissues (0.2 g) were pulverized on dry ice to fine powders. Two ml of 
extraction buffer [50 mM Tris-HCl buffer (pH 8.0) containing 150 mM NaCl, 
5 mM EDTA, and 1% NP40 surfactant] were added to the tissue powders, and 
the mixture was incubated on ice for 30 min with repeated shaking and vortex 
mixing every 10 min. Mixtures were centrifiiged at 14,000 X g at 4°C for 30 
min. The supematants representing the tissue extracts were collected and 
stored at -80°C until use. Our procedures have been approved by the Institu- 
tional Review Board of Mount Sinai Hospital. 

Immunofluorometric Assays. KLK protein levels were measured by 
ELISA-type time-resolved immunofluorometric assays developed in our lab- 
oratory. These sandwich-type immunoassays use monoclonal and polyclonal 
antibodies developed against recombinant proteins. All assays were shown to 
be specific, with no cross-reactivity from other KLKs (12, 16, 17). The protein 
level of each KLK was normalized according to the total protein content of 
each sample and expressed as ng KLK/mg total protein. 

Statistical Analysis. Correlation analyses were performed using Spearman 
correlation coefficient. The analysis of differences between the variables in the 
three groups was performed with the Kruskal-Wallis test. 

RESULTS 

First, to verify the reliability and sensitivity of the bioinformatic 
analysis, we compared the in silico pattern of expression of each KLK 
in normal tissues to the previously published experimental data on 
tissue KLK expression (8). Results obtained were concordant with the 
published data. For example, most EST and SAGE clones for KLK2, 
KLKS, and KLK4 were found in the prostate, whereas KLKS was 
highly expressed in normal breast tissue and skin (data not shown). 
Also, for those KLKs that show differential expression in ovarian 
cancer (see below), levels of expression were about the same between 
normal and cancer tissues for other malignancies, further verifying the 
accuracy of our results. 

Tabic I In silico analysis of KLK gene expression in normal and ovarian cancer 
tissues using the SAGE databases 



KLK Library Type Positivity" Average density* 



KLKS 


Normal ovary 


0/2 


0 




Ovarian cancer 


5/10 


177 


KLK6 


Normal ovary 


0/2 


0 




Ovarian cancer 


5/10 


262 


KLK7 


Normal ovary 


0/2 


0 




Ovarian cancer 


4/10 


103 


KLKS 


Normal ovary 


0/2 


0 




Ovarian cancer 


6/10 


192 


KLKIO 


Nonnal ovary 


1/2 


20 




Ovarian cancer 


4/10 


179 


KLKU 


Normal ovary 


1/2 


20 




Ovarian cancer 


4/10 


408 


KLK14 


Normal ovary 


0/2 


0 




Ovarian cancer 


6/10 


192 



" Number of positive libraries out of the total examined. 
Expression densities corrected as number of tpm. 



Table 2 In silico analysis of KLK gene expression in libraries from normal. 



premalignant, and ovarian cancer tissues as determined by EST database 



KLK 


Library T\pe 


Positiviiy 


No. of clones 


KLKS 


Normal ovary 


0/9 


0 




Premalignant 


0/3 


0 




Ovarian cancer 


5/17 


16 


KLK6 


Nonnal ovary 


0/9 


0 




Premalignant 


0/3 


0 




Ovarian cancer 


1/17 


21 


KLK7 


Normal ovary 


1/9 


1 




Premalignant 


0/3 


0 




Ovarian cancer 


4/17 


5 


KLKS 


Normal ovary 


0/9 


0 




Premalignant 


0/3 


0 




Ovarian cancer 


4/17 


5 


KLK JO 


Normal ovary 


0/9 


0 




Premalignant 


0/3 


0 




Ovarian cancer 


3/17 


20 


KLKU 


Normal ovary 


0/9 


0 




Premalignant 


0/3 


0 




Ovarian cancer 


3/17 


8 


KLKI4 


Normal ovary 


0/9 


0 




Premalignant 


0/3 


0 




Ovarian cancer 


1/17 


3 



Of the 15 KLK genes analyzed in this study, 7 genes {KLK5-8, 
KLKIO, KLKU, and KLK 1 4) were found to be up-regulated in ovarian 
cancer tissues and cell lines, compared with normal ovary. Probing all 
ovarian libraries from different sources (cancerous and normal fix)m 
tissues and cell lines) with iCL^J-specific sequences revealed that 
KLKS is not detectable in normal ovary as determined by SAGE 
analysis (Table 1) and EST library screening (Table 2). These data are 
consistent with previotis experimental reports showing no expression 
of KLKS in normal ovary (18). However, VNB analysis showed that 
KLKS is expressed in 5 of 10 ovarian cancer libraries with an average 
blot density of 177 tpm (Table 1). These data were furflier verified by 
screening the EST databases using the BLAST search engine; KLKS 
was expressed in 5 of 17 ovarian cancer EST libraries, whereas 16 
KLKS mRNA clones were isolated, compared with no clones in 
libraries from normal or premalignant lesions (Table 2). X-profiler 
analysis showed a significant difference when comparing KLKS gene 
expression between normal and malignant pools of ovarian libraries 
(Table 3). 

KLK6 was originally cloned as a gene that is highly expressed in 
ovarian cancer but not in normal ovaries (19). This finding was later 
confirmed by analyzing KLK6 expression patterns in normal tissues 
by PCR (20) and by immunohistochemistry (21). In silico analysis in 
the current study showed that whereas KLK6 was not detectable by 
VNB in normal ovary, it was detected in 5 of 10 ovarian cancer SAGE 
libraries with a relatively high blot density (262 tpm; Table 1). In 
addition, 21 ovarian cancer EST clones were detected, compared with 
no clones for normal or premalignant ovarian tissues and cell lines 
(Table 2). X-profiler analysis further indicated a significant difference 
of KLK6 expression (expression factor, 0.953) when comparing nor- 
mal and malignant pools of ovarian libraries (Table 3). 

KLKS was also found to be expressed at a fairly decent density 
(average, 192 tpm) in 6 of 10 ovarian cancer libraries but not in 
hbraries from normal ovaries. Analysis of the EST database also 
showed no expression in any of the nine normal ovarian libraries 
screened, whereas five KLKS EST clones were detected in four 
ovarian cancer libraries (Table 2). This was ftirther confirmed by 
X-profiler analysis (Table 3). A similar overexpression pattern was 
observed for KLKIO, as shown m Tables 1-3. 

Weak KLKU expression was detected by SAGE analysis in normal 
ovarian libraries. However, expression density was about 20-fold 
higher in ovarian cancer tissues and cell lines (20 versus 408 tpm for 
normal and cancer, respectively; Table 1), In addition, all ESTs 
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Table 3 X-profiler analysis of KLK gene expression in normal versus cancerous 
ovarum tissues and ci II lines 



Average gene expression 



KLK 


Cancer* 


Normal^ 


Factor of gene expression" 


KLK5 


18 


0 


0.752 


KLK6 


36 


0 


0.9S3 


KLK7 


22 


0 


0.823 


KLK8 


23 


0 


0.838 


KLK 11 


17 


1 


0.557 


KLK14 


23 


0 


0.838 



"A pool of 10 ovarian cancer SAGE libraries. 
A pool of two normal ovarian SAGE libraries. 
A factor of 1 represents the highest statistical significance. 



detected for this gene were from cancer libraries, not normal libraries 
(Table 2). This was also confirmed by X-profiler analysis (Table 3). 
Whereas no KLK14 clones were detected in normal SAGE or EST 
libraries, mRNA tags were found in 6 out of 10 cancer libraries by 
VNB with an average density of 192 tpm, and 5 EST clones from an 
ovarian cancer library (Tables 1 and 2). 
No other KLKs were found to be differentially expressed in ovarian 



cancer by these methods. For instance, KLKS was found to be ex- 
pressed in 1 of 2 normal ovarian libraries compared with 3 out of 10 
ovarian cancer libraries, with an average density of 41 tpm in normal 
compared with 45 tpm in cancer (data not shown). 

Overeipression of KLK Proteins in Ovarian Cancer. To verify 
the in silico mRNA overexpression results and to examine whether 
these mRNA up-regulations reflect enzyme changes at the protein 
level, we compared protein expression levels of six up-regulated 
KLKs in normal, benign, and cancerous ovarian tissues (KLKI4 could 
not be examined due to lack of an ELISA assay for this KLK). As 
shown in Fig. 1 and Table 4, there is a stepwise increase in the 
expression levels of these KLKs among normal, benign, and malig- 
nant tissues. These differences were statistically significant by the 
Kruskal-WalUs test for hKli (P < 0.001), hK5 {P = 0,048), hK6 
(P = 0.044), and hKlO {P = 0.027) but not for hK7 and hK8 
(P > 0.05; Fig. 1). When comparing normal and cancer tissues, the 
median levels of expression increased between 2-fold (hK5) to ~ 80- 
fold (hKll; Table 4). 

By Spearman correlation, moderate to strong correlations were 
observed between the expression levels of five of these KLKs (hK5, 



Fig. 1, KLK protein expression in normal {N = 10), benign 
{N ~ 10), and cancerous {N = 20) ovarian tissues. KLK levels 
are expressed in ng/mg total protein. Ps were calculated by 
Kniskal-Wallis test. For discussion, see Results. 




Normal Benign Cancer 

2225 



Normal Benign Cancer 
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hK6, hK7, hK8, and hKlO), and the results were highly statistically 
sign;ticant (r, = 0.45-0.74 and P < 0.001; Table 5). hICl 1 appears to 
follovv a different expression pattern and does not correlate well with 
any of the other proteins (Table 5). 

DISCUSSION 

Our results show that 7 of the 15 human KLKs are up-regulated in 
ovarian cancer. The differential expression of certain KLKs in ovarian 
cancer has been documented previously (11, 22, 23). KLK4 has 
prognostic value in ovarian cancer patients (13, 24), and KLKS over- 
expression is an indicator of poor prognosis (25). We have also shown 
recently that hK6 (15), hKlO (26, 27), and hKll (12) are useful 
serological diagnostic markers for ovarian cancer because their serum 
levels are higher than normal in >50% of ovarian cancer patients. 
KLK6 was reported to be overexpressed in ovarian cancer, and it has 
been suggested to play a role in cancer invasion (19, 28). KLK7 and 
KLKS were also reported to be overexpressed in ovarian cancer 
patients compared with normal subjects (14, 28, 29). Also, KLK9 was 
shown to have prognostic value in ovarian cancer (30). 

It has been reported previously that four of the most centromeric 
KLKs {KLK2, KLK3, KLK4, and KLK15) are differentially expressed 
in prostate cancer and are androgen-regulated (8, 31). Here we pro- 
vide strong evidence that many members of the telomeric cluster of 
adjacent KLKs (KLKS -14) are overexpressed in ovarian cancer. This 
points out the possibility for the presence of a locus control region or 
another common regulatory mechanism for these KLKs. Because 
many of these genes are regulated by steroid hormones, especially 
estrogens (8), it will be interesting to investigate whether their up- 
regulation in cancer is a result of a hormonal derangement. 

We used two independent databases (EST and SAGE) to generate 
our results. We calculated the proportion of positive libraries from 
each tissue type, in addition to the density of expression. We did not 
attempt to quantify KLK gene expression from the EST databases 
because many EST libraries are normalized or subtracted, which could 
bias the results. It is further emphasized that in silico analysis data 
should ahvays be experimentally confirmed. Possible sources of bias 
include sequence errors, presence of mutations associated with certain 
malignancies, unequal presentation of cDNAs in different libraries, 
and expression of splice variants in certain malignancies. 

Lercher et al (32), during global analysis of > 11,000 genes in 14 
different tissues, selected a cutoff value for gene expression as being 
low (^37 tpm) and intermediate or high (^134 tpm). If this cutoff is 

Table 4 Descriptive statistics of protein levels of six KLKs in ovarian tissue extracts 



All values are in og kallikrein/mg total protein. 









hK5 


hK6 


hK7 


hKS 


hKlO 


hKll 


Normal tissue 


s(N = 


10) 














Mean 






0.041 


1.43 


0.50 


0.96 


0.66 


0.51 


SE 






0.006 


0.33 


0.16 


0.37 


0.18 


0.25 


Median 






0.05 


1.45 


0.34 


0.47 


0.59 


0.17 


Minimum 






O.Ol 


0.05 


0.00 


0.03 


0.08 


0.03 


Maximum 






0.08 


2.83 


1.56 


3.91 


2.11 


2,50 


Benign tissues 


iiN = 


10) 














Mean 






0.16 


2.37 


1.94 


3.99 


1.27 


4.75 


SE 






0.04 


0.74 


0.60 


1.76 


0.37 


1.49 


Median 






0.12 


1.60 


1.20 


0,99 


1.03 


4.01 


Minimum 






0.01 


0.29 


0.22 


0,09 


0.12 


0.25 


Maximum 






0.47 


8.43 


5.48 


14.80 


3.90 


12.76 


Cancer tissues 




20) 














Mean 






0.62 


9.0 


7.27 


5.80 


4.88 


14.65 


SE 






0.21 


2.5 


2.13 


1.38 


1.27 


1.49 


Median 






0.U 


3.15 


1.26 


4.63 


3.53 


13.81 


Minimum 






0.00 


0.05 


0.09 


0.00 


0.03 


1.92 


Maximum 






2.99 


35.03 


27.56 


17.49 


20.23 


27.52 



Table 5 Correlations between protein levels of six KLKs in ovarian tissue extracts" 







hK6 


hK7 


hK8 


hKlO 


hKll 


hK5 




0 53 






U.0 1 


U. 1 I 




p 


< 0.001 


< 0.001 


<0 001 


<Q QQI 


0 48 


hK6 






0.64 


0.52 


0.45 


0.34 




p 




<0.001 


0.001 


0.003 


0.034 


hK7 


'•s 






0.66 


0.59 


0.23 




p 






<0.001 


<0.001 


0.14 


hK8 










0.390 


0.17 




p 








0.013 


0.29 


hKlO 












0.22 




p 










0.15 



° Spearman correlation. 



applied to our data, we could conclude that whereas KLK gene 
expression is absent or low in normal ovarian tissues, medium to high 
levels of expression of 6 of the 15 KLKs are seen in ovarian cancer. 

Dong et al (13) and Obiezu et aL (24) previously reported higher 
levels of expression of KLK4 in ovarian cancer compared with normal 
ovarian tissues. In this study, no detectable expression of KLK4 was 
found in any normal or cancerous tissue, as determined by SAGE or 
EST analysis. This might reflect the fact that KLK4 is expressed in the 
ovaries at very low amoimts, so that the mRNA is not detectable, 
except by using a highly sensitive technique such as reverse transcript 
tion-PCR. 

Our current results, indicating KLK overexpression at the protein 
level in ovarian cancer, are in accord with previous observations of 
serum levels of a few of these KLKs in ovarian cancer. We have 
previously shown that serum levels of hK6 (15), hKlO (27), and hKl 1 
(12) are elevated in ovarian cancer patients. Similar data have been 
generated for hK5, hK7, and hK8.^ 

Several hypotheses can be proposed regarding the possible involve- 
ment of KLKs in ovarian cancer. Being serine proteases, they could be 
implicated in tumor progression through extracellular matrix degra- 
dation. Previous studies have shown that serine protease overexpres- 
sion is associated with unfavorable clinical prognosis of various forms 
of cancer (33). Ovarian cancer is a "hormonal" malignancy. KLKs, 
being under steroid hormone regulation (8), may represent down- 
stream targets through which hormones affect the initiation or pro- 
gression of ovarian cancer. The statistically significant positive cor- 
relations between expression levels five of six of these KLKs (Table 
5) and the parallel overexpression, as well as data presented elsewhere 
(22), are supportive of an emerging idea that a number of these 
enzymes are part of a cascade enzymatic pathway. Direct evidence is 
currently lacking. 

In conclusion, here we provide strong evidence suggesting that 
seven KLKs are up-regulated in ovarian cancer. These data were 
confirmed by examining several databases, in addition to protein 
analysis of KLKs in tissue cytosols. Further experimentation is nec- 
essary to establish the usefulness of these KLKs for diagnosis, prog- 
nosis, and treatment of ovarian cancer. 
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and Sotiropoulou, G. Immunofluorometric assay of human kallikrein 6 (zyme/prote- 
ase M/neurosin) and preliminary clinical applications. Clin. Biochem., 33: 369-375, 
2000. 

17. Luo, L. Y-, Grass, L., Howarth, D. J., Thibault, P., Ong. H., and Diamandis, E. P. 
Immunofluorometric assay of human kallikrein 10 and its identification in biological 
fluids and tissues. Clin. Chem., 47: 237-246, 2001. 

18. Yousef, G. M., and Diamandis, E. P. The new kallikrein-like gene, KLK-L2. Mo- 
lecular characterization, mapping, tissue expression, and hormonal regulation. J. Biol. 
Chem., 274: 37511-37516. 1999. 

19. Anisowicz, A., Sotiropoulou, G., Stenman, G., Mok, S. C, and Sager, R. A novel 
protease homolog difiTerentially expressed in breast and ovarian cancer. Mol. Med., 2: 
624-636, 1996. 

20. Yousef, G. M., Luo, L. Y., Scherer, S. W., Sotiropoulou, G., and Diamandis, E. P. 
Molecular characterization of zyme/protease M/neurosin (PRSS9), a honnonally 
regulated kallikrein-like serine protease. Genomics, 62: 251-2S9, 1999. 



21- Pciraki, C. D., Karavana, V. N., Skoufogiannis, P. T., Little, S. P., Howarth, D. J., 
Yousef, G. M., and Diamandis, E. P. The spectrum of human kallikrein 6 (zymc/ 
protease M/neurosin) expression in human tissues as assessed by immunohistochem- 
istry. J. Histochem. Cytochem., 49: 143 1-1441, 2001. 

22. Yousef, G. M., and Diamandis, E. P. Kallikreins, steroid hormones and ovarian 
cancer: is there a link? Minerva Endocrinol., 27: 157-166, 2002, 

23. Diamandis, E, P.. Yousef, G. M., Luo, L. Y., Magklara, A., and Obiezu, C. V. The 
new human kallikrein gene family: implications in carcinogenesis. Trends Endocri- 
nol. Metab., //; 54-60, 2000. 

24. Obiezu, C. V., Scorilas, A., Katsaros, D., Massobrio, M., Yousef, G. M., Fracchioli, 
S., Rigault De La Longrais, I. A., Arisio, R., and Diamandis, E. P. Higher human 
kallikrein gene 4 (klk4) expression indicates poor prognosis of ovarian cancer 
patients. Clin. Cancer Res., 7; 2380-2386, 2001. 

25. Kim, H., Scorilas, A., Katsaros, D., Yousef, G. M., Massobrio, M., Fracchioli, S., 
Piccinno, R., Gordini, G., and Diamandis, E. P. Human kallikrein gene 5 (KLK5) 
expression is an indicator of poor prognosis in ovarian cancer. Br. J. Cancer, 84: 
643-650, 2001. 

26. Luo, L. Y., Katsaros, D., Scorilas, A., Fracchioli, S., Piccinno, R., Rigault de la 
Longrais, I. A., Howarth, D. J., and Diamandis, E. P. Prognostic value of human 
kallikrein 10 expression in epithelial ovarian carcinoma. Clin. Cancer Res., 7: 
2372-2379, 2001. 

27. Luo, L., Bunting, P., Scorilas, A., and Diamandis, E. P. Human kallikrein 10: a novel 
tumor marker for ovarian carcinoma? Clin. Chim. Acta, 306: 1 1 1-1 18, 2001 . 

28. Tanimoto, H., Underwood, L. J., Shigemasa, K., Parmley, T. H., and 0*Brien. T. J. 
Increased expression of protease M in ovarian tumors. Tumour Biol., 22: 11-18, 
2001. 

29. Tanimoto, H., Underwood, L. J., Shigemasa, K., Yan Yan, M. S., Clarke, J., Parmley, 
T. H., and O'Brien. T. J. The stratum coraeum chymotryptic enzyme that mediates 
shedding and desquamation of skin cells is highly overexpressed in ovarian tumor 
cells. Cancer (Phila.), 86: 2074-2082, 1999. 

30. Yousef, G. M., Kyriakopoulou, L. G., Scorilas, A., Fracchioli, S., Ghiringhello, B., 
Zarghooni, M., Chang, A., Diamandis, M., Giardina, G., Hartwick, W. J., Richiardi, 
G,, Massobrio, M., Diamandis, E, P., and Katsaros, D. Quantitative expression of the 
human kallikrein gene 9 (KLK9) in ovarian cancer: a new independent and fiivorable 
prognostic marker. Cancer Res., 61: 781 1-7818, 2001. 
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kallikrein 2 (hK2) and prostate-specific antigen (PSA): two closely related, but 
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32. Lercher, M. J., Urrutia, A. O., and Hurst, L. D. Clustering of housekeeping genes 
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Commissioner for Pateiits 
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Alexandria, VA 22313-1450 



1. J. Christopher Grimaldi, decjare and state as follows: 

L I am a Senior Research Associate in the Molecular Biology Department of 
Genentech, Inc., South San Francisco, CA 94080. 

2. My scientific Curriculum Vitae, including my list of publications, is attached to 
and forms part of this Declaration (Exhibit A). 

3. I joined Genentech in January of 1999, From 1999 to 2003, 1 directed the Cloning 
Laboratory in the Molecular Biology Department. During this time I directed or performed 
niimerous molecular biology techniques including semi-quantitative Polymerase Chain Reaction 
(PCR) analyses. I am currently involved, among other projects, in the isolation of genes coding 
for membrane associated proteins which can be used as targets for antibody therapeutics against 
cancer. In connection with the above-identified patent application, I personally performed or 
directed the semi-quantitative PCR gene expression analyses in the assay entitled "Tumor Versus 
Normal Differential Tissue Expression Distribution," which is described in EXAMPLE 18 in. the 
specification. These studies were used to identify differences in gene expression between tumor 
tissue and their normal counterparts, 

4. EXAMPLE 18 reports the results of the PCR analyses conducted as part of Ae 
investigating of several newly discovered DNA sequences. This process included developing 
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primers and analyzing expression of flie DNA sequences of interest in normal and tumor tissues. 
The analyses were designed to determine whether a difference exists between gene expression in 
normal tissues as compared to tumor in the same tissue type. 

5. The DNA libraries used in the geiie expression studies were made from pooled 
samples of normal and of tumor tissues. Data from pooled samples is more likely to be accurate 
than data obtained from a sample from a single individual. That is, the detection of variations in 
gene expression is likely to represent a more generally relevant condition when pooled samples 
from normal tissues are compared witii pooled samples from tumors in the same tissue type. 

6. In differential gene expression studies, one looks for genes whose expression levels 
differ significantly under different conditions, for example, in normal versus diseased tissue. 
Thus, I conducted a semi-quantitative analysis of the expression of the DNA sequences of 
interest in normal versus tumor tissues. Expression levels were graded according to a scale of +, - 
, and +/- to iijdicate the anaoimt of flie specific signal detected. Using the widely accepted 
technique of PGR, it was determined whether the polynucleotides tested were more highly 
expressed, less expressed, or whether expression remained the same in tumor tissue as compared 
to its normal counterpart. Because this technique relies on the visual detection of ethidium 
bromide staining of PGR products on agarose gels, it is reasonable to assume that any detectable 
differences seen between two samples will represent at least a two fold difference in cDNA. 

7. The results of the gene expression studies indicate that the genes of interest can be 
used to differentiate tumor from normal. The precise levels of gene expression are irrelevant; 
what matters is that there is a relative difference in expression between normal tissue and tumor 
tissue. The precise type of tumor is also irrelevant; again, the assay was designed to indicate 
whether a difference exists between normal tissue and tumor tissue of the same type. If a 
difference is detected, this indicates that the gene and its corresponding polypeptide and 
antibodies against the polypeptide are useful for diagnostic purposes, to screen samples to 
differentiate between normal and tumor. Additional studies can then be conducted if further 
information is desired. 

8. I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information or beUef are believed to be true, and further that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 1 8 of the United States 
Code and that such willful statements may jeopardize the validity of the application or any 
patent issued thereon. 
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EXHIBIT A 



J. Christopher Grimaldi 

1434-36*^ Ave. 

San Francisco, CA 94122 

(415) 68M639 (Home) 

EDUCATION University of California, Berkeley 

Bachelor of Arts in Molecular Biology, 1984 



EMPLOYMENT EXPERIENCE 



SRA Genentech Inc., South San Francisco; 1/99 to present 

Previously, was responsible to direct and manage the Cloning Lab. Currently focused on 
isolating cancer specific genes for the Tumor Antigen (TAP), and Secreted Tumor Protein 
(STOP) projects for the Oncology Department as well as Immunologically relevant genes for the 
Immunology Department. Directed a lab of 6 scientists focused on a company-wide team effort 
to identify and isolate secreted proteins for potential therapeutic use (SPDI). For the SPDI project 
my duties were, among other things, the critically important coordination of the cloning of 
thousands of putative genes, by developing a smooth process of communication between the 
Bioinformatics, Cloning, Sequencing, and Legal teams. Collaborated with several groups to 
discover novel genes through the Curagen project, a unique differential display methodology. 
Interacted extensively wilh the Legal team providing essential data needed for filing patents on 
novel genes discovered through the SPDI, TAP and Curagen projects. My group has developed, 
implemented and patented high throughput cloning methodologies fliat have proven to be 
essential for the isolation of hundreds of novel genes for the SPDI, TAP and Curagen projects as 
well as dozens of other smaller projects. 



Scientist DNAX Research Institute, Palo Alto; 9/91 to 1/99 

Involved in-multiple ppejeets aimed at understanding novel genes discovered through 
bioinformatics studies and functional assays. Developed and patented a method for the specific 
depletion of eosinophils in vivo using monoclonal antibodies. Developed and implemented 
essential technical methodologies and provided strategic direction in the areas of expression, 
cloning, protein purification, graeral molecular biology, and monoclonal antibody production. 
Trained and supervised num^ous technical.staff. 

Facilities 

Manager Corixa, Redwood City; 5/89 - 7/91. 

Directed plant-related activities, which included expansion planning, maintenance, safety, 
purchasing, inventory control, shipping and receiving, and laboratory management. Designed 
and implemented the safety program. Also served as liaison to regulatory agencies at the local, 
state and federal level. Was in charge of property leases, leasehold improvements, etc. 
Negotiated vendor contracts and directed the purchasing department Trained and supervised 
personnel to carry out the above-mentioned duties. 




o 



SRA 



University of California, San Francisco 
Cancer Research Institute; 2/81 -m9. 



Was responsible for nmnerous cloning projects including: studies of somatic hypermutation, 
studies of AIDS-associated lymphomas, and cloning of t(5; 14), t(l 1 ; 14), and t(8;14) 
translocations. Focused on the activation of hemopoietic growth factors involved in the t(5;l 
translocation in leukemia patients*. 



Worked on a subunit porcine vaccine directed against Mycoplasma hyopneumoniae. Was 
responsible for generating genomic libraries, sqreenmg with degenerate oligonucleotides, and 
characterizing and expressing clones m E. coli. Also constructed a general purpose expression 
vector for use by odier scientific teams. 

PUBLICATIONS 

1. Hilaiy F. Clark, et al. "The Secreted Protein Discovery Initiative (SPDI), a Large-scale 
Effort to Identify Novel Human Secreted and Transmembrane Proteins: a bioinformatics 
assessment** Genome Res/Vol 13(10), 2265-2270, 2003 

2. Sean H. Adams, Qarissa Chui' Sarah L. Schilbach, Xing Xian Yu, Audrey D. Goddard, J. 
Christopher Grimaldi, James Lee, Patrick Dowd. David A. Lewin, & Steven Colman'^BFTT, 
a Unique Acyl-CoA l^iioesterase Induced m Thermogenic Brown Adiopose Tissue: Cloning, 
organization of the humanb gene and assessment of a potential link to obesity" Biochemical 
Journal, Vol 360, 135-142. 2001 

3. Szeto W, Jiang W, Tice DA, Rubmfeld B, Hollmgshead PG, Pong SB, Dugger DL, Hiam T, 
Yansura D, Wong TA, Grimaldi JC, Corpuz RT, Singh JS, Frantz GD, , Devaux B, Crowley 
CW, SchwallRH, Eberhard DA,.Rastelli L, Polakis.P, and Rennica D. "Overexpression of 
the Retenoic Acid-Responsive Gene Stra6 m Human Cancers and its Synergistic Activation 
by Wnt-1 and Retinoic Acid." Cancer Research Vol. 61(10). 4197-4205, 2001 

4. Jeanne Kahn, Fuad Mehraban, Gladdys Ingle, Xiaohua Xin. Juliet B. Bryant, Gordon Vehar, 
Jill Schoenfeld, J. Christopher Grimaldi (incorrectly named as "Grimaldi, CP'), Frankhn 
Peale, Apama Draksharapu, David A. Lewin, and Mary E. Gerritsen. "Gene Expression 
Profiling in an in Vitro Model of Angiogenesis." American Journal of Pathology Vol 156(6). 
1887-1900,2000. 

5. Grimaldi JC, Yu NX, Grunig G, Seymour BW, Cottrez F, Robinson DS, Hosken N, Feriin 
WG, Wu X, Soto H, O'Garra A, Howard MC. Coffraan RL. "Depletion of eosinophils in 
mice through the use of antibodies specific for C-C chemokine receptor 3 (CCR3). Journal of 
Leukocyte Biology; Vol. 65(6), 846-53, 1999 



Research 
Technician 



Berlex Biosciences, South San Francisco; 7/85-2/87. 



6. Oliver AM. Grimaldi JC, Howard MC, Kearney JF. "Independently ligating CD38 and Fc 
ganunaRDB relays a dominant negative signal to B cells.*' Hybridoma Vol, 18(2), 1 13-9, 
1999 



7. Cockayne DA, Muchamuel T, Grimaldi JC, MuUer-Steffiier H, RandaU TD, Lund FE, 
Murray R, Schuber F, Howard MC. 'TMBce deficient for the ecto-nicotinamide adenine 
dinucleotide glycohydrolase CD38 exhibit altered humoral inunune responses." Blood Vol 

92(4), 1324-33, 1998 

8. Frances E. Lund, Nanette W. Solvason, Michael P. Cooke, Andrew W. Heath, J. Christopher 
Grimaldi, Troy D. Randall, R. M. Parkhouse, Christopher C Goodnow and Maureen C. 
Howard. "Signaling throu^ murine CD38 is impaired in antigen receptor unresponsive B 
cells." European Journal of Inununology, Vol. 25(5), 1338-1345, 1995 

9. M. J. Guimaraes, L F. Bazan, A. Zolotnifc, M. V. WUes, J. C. Grimaldi, F. Lee. T. 
McClanahan. "A new approach to the study of haematopoietic development in the yolk sac 
and embryoid body." Development, Vol. 121(10), 3335-3346, 1995 

10. L Christopher Grimaldi, Srirara Balasubramanian, J. Feraandp Bazan, Armen Shanafelt, 
Gerard Zurawski and Maureen Howard. "CD38-mediated protein ribosylation." Journal of 
Immunology,VoL 155(2), 811-817, 1995 

1 1. Leopoldo Santos-Argumedo, Frances R Lund, Andrew W, Heath. Nanette Solvason, Wei 
Wei Wu, J. Christopher Grimaldi, R. M, E. Parkhouse and Maureen Howard. '•CD38 
unresponsiveness of xid B cells implicates Braton's tyrosine kinase (btk) as a regulator of 
CD38 mduced signal transduction." International Immunology, Vol 7(2). 163-170, 1995 

12. FrancesXund, Nanette Solvason, J. Christopher Grimaldi, R. M. E. Parkhouse and Maureen 
Howard. "Murine CD38: An imanunoregulatoiy ectoenzyme." Immunology Today Vol 
16(10), 469-473, 1995 

13- Maureen Howard, J. Christopher Grimaldi, J. Fernando Bazan, Frances R Lund, Leopoldo 
Santos-Argumedo, R. M. E. Parkhouse, Timothy F. Walseth, and Hon Cheung Lee. 
"Formation and Hydrolysis of Cyclic ADP-Ribose Catalyzed by Lymphocyte Antigen 
CD38." Science, Vol. 262, 1056-1059, 1993 

14, NobuyukiJIacada, Leopoldo.Santos-Arguraedo, Ray Chang, J. Christopher Grimaldi, Frances 
Lund, Camilynn L Brannan, Neal G. Copeland, Nancy A. Jenkins, Andrew Heath, R. M, E. 
Parkhouse and Maureen Howard. "Expression Cloning of a cDNA Encoding a Novel Murine 
BCeU Activation Marker: Homology to Human CD38." Hie Journal of Immunology. Vol 
151,3111-3118,1993 ^ 

15, David J, Rawlings, Douglas C. Saffran, Satoshi Tsukada, David A. Largaespada, J. 
Christopher Grimaldi, Lucie Cohen Randolph N. Mohr, J. Fernando Bazan, Maureen 
Howard, Neal G, Copeland, Nancy A. Jenkins. Owen Witte. "Mutation of Unique Region of 
Bruton's Tyrosine Kinase in Immunodeficient XID Mice." Science, Vol. 261. 358-360, 1993 

16. J. Christopher Grimaldi, Raul Toires, Christine A. Kozak, Ray Chang, Edward Clark, 
Maureen Howard, and Debra A. Cockayne. "Genomic Structure and Chromosomal Mapping 
of the Murine CD40 Gene/' The Journal of Immunology, Vol 149, 3921-3926, 1992 

17. Tinaothy C. Meeker, Bmce Shiramizu, Lawrence Kaplan, Brian Hemdier. Henry Sanchez, J. 
Christopher Grimaldi, James Baumgartner, Jacab Rachlin. Ellen Feigal, Mark Rosenblum and 
Michael S. McGrath. "Evidence for Molecular Subtypes of HIV-Associated Lymphoma: 



Division into Peripheral Monoclonal, Polyclonal and Central Nervous 
AIDS, Vol. 5, 669-674, 1991 



18. Ann Grimaidi and Chris GrimaldL "Small-Scale Lambda DNA Prep." Contribution to 
Current Protocols in Molecular Biology, Supplement 5, Winter 1989 

19. J. Christopher Grimaidi, Timothy C. Meeker. "The t(5;14) Chromosomal Translocation m a 
Case of Acute Lymphocytic Leukemia Joins the Interleukin-3 Gene to the Immunoglobulin 
Heavy Chain Gene." Blood, Vol 73, 2081-2085, 1989 

20. Timothy C. Meeker, J. Christopher Grimaidi, et al. "An Additional Breakpoint Region in the 
BCL-1 Locus Associated with the t(ll;14) (ql3;q32) Translocation of B-Lymphocytic 
MaHgnancy." Blood, Vol. 74, 1801-1806, 1989 

21 Timothy C. Meeker, J. Christopher Grimaidi. Robert OHourke, ct al. "Lack of Detectable 
Somatic Hypermutation in the V .Region of the Ig H Chain Gene of a Human Chronic B 
Lymphocytic Leukemia." The Journal of Immunology, Vol. 141, 3994-3998, 1988 

MANUSCRIPTS IN PREPARATION 

1. Sriram Balasabramanian, J. Christopher Grimaidi. J. Fernando Bazan, Gerard Zurawski and 
Maureen Howard. "Structural and functional characterization of CD38: Identification of 
active site residues" 



PATENTS 

1 . "Methods for Eosinophil Depletion with Antibody to CCR3 Receptor" (US 6,207,155 B 1). 

2. "Amplification Based Qoning Method." (US 6,607,899) 

3. Ashkenazi et al., "Secreted and Transmembrane Polypeptides and Nucleic Acids Encoding 
the Same." (this patent cavers several hundred genes) ^ 

4. "IL-17 Homologous Polypeptides and Hierapeutic Uses Heieof 

5. "Method of Diagnosing and Treating Cartilaginous Disorders." 
MEMBERSHIPS AND ACTIVITIES 

Editor Frontiers in Bioscience 

Member DNAX Safety Committee 1991-1999 

Biological Safety Affairs Forum (BSAF) 1990-1991 
Environmental Law Foundation (ELF) 1990-1991 
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DECLARATION OF J. CHRISTOPHER GRIMALDL UNDER 37 C.F.R. § 1.132 

Commissioner for Patents 
P.O. Box 1450 
Alexandria, VA 22313-1450 



I, J. Christopher Grimaldi, declare and say as follows: 

1, I am a Senior Research Associate in the Molecular Biology Department of 
Genentech, Inc., South San Francisco, CA 94080. 

2, I joined Genentech in January of 1999. From 1999 to 2003, 1 directed the Cloning 
Laboratory in the Molecular Biology Department. During this time I directed or performed 
numerous molecular biology techniques including qualitative Polymerase Chain Reaction (PCR) 
analyses. I am currently involved in, among other projects, the isolation of genes coding for 
membrane associated proteins which can be used as targets for antibody therapeutics against 
cancer. In connection with the above-identified patent application, I personally performed or 
directed the semi-quantitative PCR analyses in the assay entitled "Tumor Versus Normal 
Differential Tissue Expression Distribution" which is described in EXAMPLE 18 in the 
specification that were used to identify differences in gene expression between tumor tissue and 
their normal counterparts. 

3, My scientific Curriculum Vitae, including my list of publications, is attached to 
and forms part of this Declaration (Bchibit A). 

4, In differential gene expression studies, one looks for genes whose expression levels 
differ significantly under different conditions, for example, in normal versus diseased tissue. 
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Chromosomal aberrations, such as gene amplification, and chromosomal translocations aie 
important markers of specific types of cancer and lead to the aberrant expression of specific 
genes and their encoded polypeptides, including over-expression and under-expression. For 
example, gene amplification is a process in which specific regions of a chromosome are 
duplicated, t^us creating multiple copies of certain genes that normally exist as a single copy. 
Gene under-expression can occur when a gene is not transcribed into nciRNA. In addition, 
chromosomal translocations occur when two different chromosomes break and are rejoined to 
each other chromosome resulting in a chimeric chromosome which displays a different expression 
pattem relative to the parent chromosomes. Amplification of certain genes such as Her2/Neu 
[Singleton et al, Pathni Ann^y^ 27Ptl :165-190], or chromoson^ translocations such as t(5;14), 
[Grimaldigffl/.,Bloo473(8):208>2085(1989); Meeker 6f a/.. Blood 76(2):285.289(1990)] give 
cancer cells a growth or sxirvival advantage relative to normal cells, and might also provide a 
mechanism of tumor cell resistance to chemotherapy or radiotherapy. When the chromoisomal 
aberration results in ttie aberrant expression of a mRNA and the corresponding gene product (the 
polypeptide), as it does in the aforementioned cases, the gene product is a promising target for 
cancer therapy, for example, by the therapeutic antibody approach. 

5. Comparison of gene expression levels in normal versus diseased tissue has 
hnportant implications both diagnostically and therapeutically. For example, those who work in 
this field are well aware that in the vast majority of cases, when a gene is over-expressed, as 
evidenced by an increased production of mRNA, the gene product or polypeptide will also be , 
over-expressed. It is xmlikely that one identifies increased mRNA expression without associated 
increased protein expression. This same principle applies to gene uiider-expression. When a 
gene is under-expressed, the gene product is also likely to be under-expressed. Stated in another 
way, two cell samples which have diflfering mRNA concentrations for a specific gene are 
expected to have correspondingly different concentration of protein for that gene. Techniques 
used to detect mRNA, such as Northern Blotting, Differential Display, in situ hybridization, 
quantitative PGR, Taqman, and more recentiy Microarray technology all rely on the dogma that a 
change in mRNA will represent a similar change in protein. If this dogma did not hold true then 
these techniques would have little value and not be so widely used. The use of mRNA 
quantitation techniques have identified a seemingly endless number of genes which are 
differentially expressed in various tissues and these genes have subsequenfly been shown to have 
correspondingly similar changes m their protein levels. Thus, flie detection of increased mRNA 
expression is expected to result in increased polypeptide expression, and the detection of 
decreased mRNA expression is expected to result in decreased polypeptide expression. The 
detection of increased or decreased polypeptide expression can be used for cancer diagnosis and 
treatment. 

6. However, even in the rare case where the protein expression does not correlate 
with the mRNA expression, this still provides significant information useful for cancer diagnosis 
and treatment For example, if over- or under-expression of a gene product does not correlate 
with over- or under-expression of mRNA in certain tumor types but does so in others, then 
identification of both gene expression and protein expression enables more accurate tumor 
classification and hence better determination of suitable therapy. In addition, absence of over- or 
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under-exprcssion of the gene product in the presence of a particular over- or und^-e^qpression of 
mRNA is crucial information for the practicing clinician. For example, if a gene is over-expressed 
but the corresponding gene product is not signijficantly over-expressed, the clinician accordingly 
will decide not to treat a patient with agents that target that gene product 

7. I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information or belief are believed to be true, and further that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful statements may jeopardize the validity of the application or any 
patent issued thereon. 
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(415) 681-1639 (Home) 



EDUCATION 



University of Califomia, B^keley 
Bachelor of Arts in Molecular Biology, 1984 



EMPLOYMENT EXPERIENCE 



SRA 



Genentech Inc., South San Francisco; 1/99 to present 



Previously, was responsible to direct and manage the Cloning Lab. Surrently focused on. 
isolating cancer specific genes for the Tumor Antigen (TAP), and Secreted Ttmor Protein 
(STOP) projects for the Oncology Department as well as Immunologically relevant genes for the 
Immunology Department. Directed a lab of 6 scientists focused on a company-wide team effort 
to identify and isolate secreted proteins for potential therapeutic use (SPDI). For the SPDI projec 
my duties were, among other thmgs, the critically unportant coordmation of the cloning of 
thousands of putative genes, by developmg a smooth process of communication between the 
Bioinformatics, Cloning. Sequencing, and Legal teams. Collaborated with several groups to 
discover novel genes through the Curagen project, a unique differential display methodology. 
Interacted extensively with the Legal team providing essential data needed for filing patents on 
novel genes discovered through the SPDI, TAP and Curagen projects. My group has developed, 
implemented and patented high throughput cloning methodologies that have proven to be 
essential for the isolation of hundreds of novel genes for the SPDI, TAP and Curagen projects as 
well as dozens of other smaller projects. 



Scientist DNAX Research Institute, Palo Alto; 9/91 to 1/99 

Involved in multiple projects aimed at understanding novel genes discovered through 
bioinformatics studies and functional assays. Developed and patented a method for flie specific 
depletion of eosinophils in vivo using monoclonal antibodies. Developed and implemented 
essential technical methodologies and provided strategic direction in the areas of expression, 
cloning, protein purification, general molecular biology, and monoclonal antibody production. 
Trained and supervised numerous technical staff. 



Directed plant-related activities, which included expansion planning, maintenance, safety, 
purchasing, inventory control, shipping and receiving, and laboratory management. Designed 
and implemented the safety program. Also served as liaison to regulatory agencies at tiie local, 
state and federal level. Was in charge of property leases, leasehold improvements, etc. 
Negotiated vendor contracts and directed tiie purchasing department. Trained and supervised 
personnel to carry out the above-mentioned duties. 
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Manager 



Corixa, Redwood City; 5/89 - 7/91. 
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SRA University of California, San Francisco 

Cancer Research Institute; 2/87-4/89. 

Was responsible for numerous cloning projects including: studies of somatic hypermutation, 
studies of AIDS-associated lyD[q>homas, and cloning of t(5;14), t(ll;14), and t(8;14) 
translocations. Focused on the activation of hemopoietic growth factors involved in the t(5;14) 
translocation in leukemia patients.. 



Research 

Technician Berlex Biosciences, South San Francisco; 7/85-2/87. 

Worked on a subunit porcme vaccine directed against Mycoplasma hyopneumoniae. Was 
responsible for generating genomic libraries, screening with degenerate oligonucleotides, and 
characterizing and expressing clones in E. coli. Also constructed a general purpose expression 
vector for use by other scientific teams. 
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The t(5;14) Chromosomal Translocation in a Case of Acate Lymphocytic 
Leukemia Joins the Interleukin-3 Gene to the Immunoglobulin Heavy Chain Gene 

By J. Christopher Grimaldt and Timothy C. Meeker 



Chromosomal translocations have proven to foe important 
markers cf the genetic abnormalities central to the patho- 
genesis of cancer. By cloning chromosomal breakpoints 
one oan identify activated proto-oncogenes. We have stud- 
ied a ease of B-lIneage acute lymphocytic leukemia (ALL) 
that was associated with peripheral blood eoslnophilla. The 
chromosomal translocation t(6;14l (q31;c|32) from this 
sample was cloned and studied at the molecular teveL This 



translocation joined the immunoglobulin heavy chain join- 
ing (Jh) region to the promoter region of the interleukin-3 
(IL>3) gene in opposite transcriptional orientations. The 
data suggest that acthratlon of the IL-3 gene by the 
enhancer of the immunoglobulin heavy chain gene may play 
a central role In the pathogenesis of this leukemia and the 
associated eoslnophilia. 
ft 19S9 by Grune & Stratton, Inc. 



KARYOTYPIC STUDIES of leukemia and lymphoma 
have identified frequent nonrandom chromosomal 
translocations. Some of these translocations juxtapose the 
immunoglobulin heavy chain (IgH) gene with important 
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Fig 1. DNA btots of the leukemia sample. The restriction 
fragment pattern of normal human DNA (N) and the leukemia 
sample (L) were compared using a human Jh probe« Rearranged 
bands are indicated by arrows. Sample L exhibits a single rear*- 
ranged band with both Hfmf 111/ fcoRl and SaU3A restriction 
digests. The rearranged bands are less Intense than the other 
bands because the majority of cells in the sample represent normal 
bone marrow elements. 



protooncogenes, such as c-myc and bcl-l}^ In this way, the 
IgH gene can activate protooncogenes, resulting in disor- 
dered gene expression and a step in the development of 
cancer. The investigation of additional nonrandom transloca> 
tions into the IgH locus allows us to identify new genes 
promoting the generation of leukemia and lymphoma. 

A distinct subtype of acute lymphocytic leukemia (ALL) 
has been characterized by B-!ineagc phenotype, associated 
eosinophilia in the peripheral blood, and a t(5;14)(q31;q32) 
chromosomal translocation.^*^ This syndrome probably 
occurs in <l% of all patients with ALL. We hypothesized 
that the cloning of the translocation characteristic of this 
leukemia might allow the identification of an important gene 
on chromosome 5 that plays a role in the evolution of this 
disease. In this report we demonstrate that the interleukin-3 
gene (IL-3) and the IgH gene arc joined by this transloca- 
tion. 

MATERIALS AND METHODS 

Sample and DNA blots^ A bone marrow aspirate from a repre- 
sentative patient with ALL (LI morphology by French-American- 
Britisb [FABl criteria), peripheral eosinophilia (up to 20,000 per 
microliter with a normal value of <350 per microliter) and a 
t(5;14)(q31;q32) translocation was studied. Using published meth- 
ods, genomic DNA was isolated and DNA blots were made.' Briefly, 
10 ftg of high molecular weight (mol wt) DNA were digested using 
an appropriate restriction enzyme and electrophoresed on a 0.8% 
agarose gel. The gel was stained with ethidium bromide, photo- 
graphed, denatured, neutralized, and transferred to Hybond (Amer- 
sham, Arlington Heights, IL). After treatment of the filter with 
ultraviolet tight, hybridization was performed. The filter was washed 
to a final stringency of 0.2% saturated sodium citrate (SSC) and 
0.1% sodium lauryly sulfate (SDS) and exposed to film. The human 
Jh probe has been previously reported.* 

Genomic library. The genomic library was made using pub- 
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lished methods.^ Approximately 100 tig of high mol wt genomic 
DNA were parUaUy d^ested with the Sau3A restriction enzyme. 
Fragments from 9 to 23 fcilobascs (kb) in size wci€ isolated on a 
sucrose gradient and ligated into phage EMBL3A (Strategene, San 
Diego). Recombinant phage were packaged^ plated, and screened as 
previously reported.* 

DNA sequencing. Fragments for sequencing were cloned into 
MI3 vectors and sequenced by the chain ternunation method using 
Sequenase (United States Biochemical, aeveland)/ All sequence 
data were derived from both strands. 

RESULTS 

Wc studied a bone marrow sample from a patient with 
ALL and associated peripheral eosinophilia. Karyotypic 
analysis showed the characterisUc t(5;14)(q31;q32) translo- 
cation. These features define a distinctive subtype of ALL.'** 
The leukemic cells were analyzed for cell surface phenotype 
by immunofluorecence. They were pc^itive for Bl (CD20) 
B4 (CD19). cALLA' (CDIO). HLA-DR. and temSiai 
deoxynucleotidyl transferase (Tdt). but negative for surface 
immunoglobulin. This phenotypic profile describes an imma- 
ture ceil from the B-lymphocytic lineage.* 

The leukemia DNA was analyzed by Southern blotting for 
rearrangements of the IgH gene. Using a human immuno- 
globulin Jh probe, a single rearranged band was detected by 
EcoRh Hindin, Sstl, Sau3A, and EcoRl plus HindlU 
resUiction digests, suggesting rearrangement of one allele 
(Fig I). The immunoglobulin Jh region from the other allele 
was presumably cither deleted or in the germlinc configura- 
tion. 

We hypothesized that the t(5;14)(q31;q32) juxtaposed a 
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growth-promoting gene on chromosome 5 with the immuno- 
globulin Jh region on chromosome 14. Therefore, a genomic 
library was made from the leukemic sample and screened 
with a Jh probe. Fifteen distinct positive clones were isolated 
and screened for the presence of the rearranged Sau3A 
fragment that was detected by DNA blotting. By this 
analysis, five clones appeared to represent the rearranged 
allele identified by DNA blots. One of these clones (clone no. 
4) was chosen for further study and a detaUed restriction 
map was generated. The £coRI. HindlU/ EcoKl, and Sstl 
fragments from clone no. 4 that hybridized to the human Jh 
probe were also identical in size to the rearranged fragments 
from the leukemia sample, confirming that clone no. 4 
represented the rearrangod leukemic allele. 

Phage clone no. 4 contained 3.7 kb of unknown origin 
joined to the IgH gene in the region of Jh4 (Fig 2). The IgH 
gene from Jh4 to the Cmu region appeared to be in germline 
configuration. Previously.- the gene encoding hematopoietic 
growth factor IL-3 had been mapped to chromosome 5q3 1 so 
it was suspected that clone no. 4 might contain part of this 
gene.'-" When the restriction map of human IL-3 and clone 
no. 4 were compared, they were identical for more than 3 kb 
(Fig 2). 

We confirmed the juxtaposition of the IL-3 gene and the 
IgH gene by nucleic acid sequencing of the subcloned 
BstEll/Hpal fragment (Fig 2). The sequence of this frag- 
ment showed no disruption of the protein coding region or the 
messenger RN A of the IL-3 gene. The break in the IL-3 gene 
occurred in the promoter region, 452 base paire (bp) 
upstream of the transcriptional start site (position 64, Fig 
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3A). The break in the IgH gene occurred 2 bp upstream of GM-CSF maps vnthin 9 kb of IL-3 In the same tram«np. 

the Jh4 region. Between the two breaks. 25 bp of uncertain tional orientaUon.«« Usrag this information and assuming a 

Orion (putativeN sequence) were inserted." " No sequences simple translocation event in our sample, we can conclude 

homologous to the immunoglobulin heptamer and nonamer that the IL-3 gene is normally more centromeric. and Ac 

could beidentifiedintheIL.3sequence(Fig3B). Therefore. GM-CSF gene more telomeric on '^'^^^f /jlf f'^^^^^ 

nucleic add sequencing confirmed the juxtaposition of the Furthermore, both are transcribed with their 5 ends toward 

lL-3 gene and the IgH gjaie. The sequence data clearly the centromere, 

showed that the genes were positioned in opposite transcrip- DISCUSSION 

tional orientations (head-to-head). ... • t.. „««..i 

Available data also aUowed us to determine the normal In this report we have cloned a unique chroinosomal 

Dosttions of the IL-3 gene and the GM-CSF gene in relation translocation that appears to be a consistent fwiture of a rare, 

jotfiecentromereof chromosome 5 (Fig 4). The IgH gene is yet distinct, clinical form of acute leukemia This transtoca- 

known to be positioned with the variable regions toward the tion joined the promoter of the IL-3 gene to the IgH gene, 

tdomere on <^osome I4q.»-" It has also been shown that Except for the altered promotor. the IL-3 gene appeared 

5 • TACCAaCAAACTCTCATCTGTTCCAGTGGCCTCCTGGCCACCCACCAGGACCAAGCAGGGC^ 160 
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intact as no deletions, insertions, or point mutations were 
detected by restriction mapping of the entire gene and 
sequencing of part of the gene. The IgH gene has been 
truncated at the Jh4 region, which places the immunoglobu- 
lin enhancer within 2.5 kb of the IL-3 gene."-" This leads to 
the hypothesis that the enhancer is increasing transcription 
of a structurally normal IL-3 gene. The same mechanism is 
important for activation of the i>myc gene in some cases of 
Burkitt's lymphoma." An alternate hypothesis is that the 
elimination of an upstream IL-3 promoter dement is crucial 
to the activation of the IL-3 gene. 

The proposed activation of the IL-3 gene suggests that an 
autocrine loop is important for the pathogmesis of this 
leukemia. Over-expression of the 11^3 gene coupled with 



the presence of the IL-3 receptor in these cells could account 
for a strong stimulus for proliferation. In this regard* there 
are daU indicating tiiat immature B-lineage lymphocytes 
and B-Uneage leukemias may express the IL-3 receptor.^'^ 

An additional feature of this type of leukemia is the 
dramatic eosinophiHa, consisting of mature forms. It has 
been hypothesized that the eosinophils do not arise from the 
malignant clone, but are stimulated by the tumor."*^ 
Because of the known effect of IL-3 on eosinophil differentia- 
tion, secretion of high levels of 11^3 by leukemic cells might 
have a role in the eosinophilia in this type of leukemia. " 

The data suggest that the recombination mechanism that 
is active in the IgH gene during normal differentiation has a 
role in this translocation."'** This is supported by the break- 
point location at the 5' end of Jh4 and the presence of 
putative N-rcgion sequences. On tiie othw hand, no recombi- 
nation signal sequence (heptamer and nonamer) was found 
in this r^on on chromosome 5, suggesting that additional 
factors also played a role. Further studies will eluddate the 
mechanism of this and other translocations.. 

In the leukemia we studied, it is possible that the immuno- 
globulin enhancer also activates the GM-CSF gene» since 
this gene is probably positioned only 1 4 kb away (Fig 4). This 
is known to be within the range of enhancer activation." The 
interleukin-5 (IL-5) gene maps to chromosome 5q31.^ 
Deregulation of the IL-5 gene by this translocation would act 
syner^stically with IL-3 in the stimulation of eosinophil 
proUferation and differentiation." These and other questions 
will be answered by the study of more patient samples; We 
plan to determine whether the t(5;14)(q31;q32) transloca- 
tion is capable of activating multiple lymphokines simulta- 
neously and whether tiiey cooperate in the generation of this 
leukemia. 
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RAPID COMMUNICATION 

Activation of the Interleukiii*3 Gene by Chromosome Translocation in Acute 
Lymphocytic Leukemia With Eosinophilia 

By Timothy C. Meeker. Dan Hardy, Cheryl Willman, Thomas Hogan, and John Abrams 



The t(5;14)(q31;q32) translocation from B-llneage acute 
lymphocytic leukemia with eosinophilia has been cloned 
from two leuicemla saniptes. In both cases* this transloca- 
tion Joined the I9H gene and the interleukin-3 (IL-3) gene. In 
one patient, excess It^ mRNA was produced by the 
leukemic cells. In the second patient, serum IL-3 levels 
were measured and shown to correlate with disease 

A NUMBER OF chromosome translocations have been 
associated with human leukemia and lymphoma. In 
many cases the study of these translocations has led to the 
discovery or characterization of proto^ncogenes^ such as 
bcI-2, Crobl, and c-myc, that arc located adjacent to the 
translocation.'-^ It is now widely understood that cancer- 
associated translocations disrupt nearby proto-oncogenes. 

A distinct subtype of acute leukemia is characterized by 
the triad of B-lineage immunophenotype, eosinophilia, and 
the t(5;l4)(q31;q32) translocation.^*^ Leukemic cells from 
such patients have been positive for terminal deoxynucleotidyl 
transferase (Tdt), common acute lymphoblastic leukemia 
antigen (CALLA), and GDI 9, but negative for surface or 
cytoplasmic inununoglobulin. In previous work, we cloned 
the t(5;14) breakpoint from one leukenuc sample (Case 1) 
and determined that the IgH and interleukin-3 (IL-3) genes 
were joined by this abnormality.* In this report, we extend 
those findings by showing that the t(5;14)(q3l;q32) translo- 
cation from a second leukemia sample (C^e 2) has a sinular 
structure, and we report our study of growth factor expres- 
sion in these patients, 

MATERIALS AND METHODS 

Samples and Southern blots. Case 1 has been described.^** 
Glimcfel features of Case 2 have been described m detail.' DNA 
isolation and Southern blotting was done using previously described 
methods.' Filters were hybridized with an immunoglobulin Jb probe, 
a 280 bp Bamm/EcoHl genomic IL-3 fragment, and an IL-3 
cDNApiobe.^-* 

Northern blots, RNA isolation and Northern blotting have been 
described.' Briefly, Northern blots were done by separating 9ftg 
total RNA on 1% agarose-formaldehyde gels. Equal RNA loading in 
each lane was confirmed by ethidium bromide staining* Blots were 
hybridized with an IL-3 cDNA probe extending to the Xho I site in 
cxon 5. a 720 bp Ssi l/Kpn I probe derived from intron 2 of the IL-3 
gene, a 600 bp Nhe i/Hpa I lL-5 cDNA probe, and a 500 bp Pst 
l/Nco I granulocyte-macrophage colony stimulating factor (GM- 
CSF)cDNAprobe."»^>» 

Polymerase chain reaction. Primers were designed with BamHl 
sites for cloning. One primer hybridized to the Jh sequences from the 
IgH gene (Primer 144:5'-TAGOATCCGACGGTGACCAGGGT), 
and the other hybridized to the re^on of the TATA box in the 11^3 
gene(Prinier 161: S'-AACAOGATCCOOCCTTATATOTOCAG). 
Myvmac cham reaction (PCR) (95^ for 1 mbute, 61^ for 30 
seconds, and 72^ for 3 nunutes) was done using 500 ng genomic 
DNA and SO pmel of each primer in 100 fxL containing 67 nunol/L 
Tris-HO pH 8.8, 6.7 mmol/L MgQ^, 10% dimethyl sulfoxide 
(DMSO), 170 Mg/mL bovine serum albumin (BSA) (fraction V), 



acth/lty. There was no evidence of excess granulocyte/ 
macrophage colony stimulating factor (GM-CSF) or IL-5 
expression. Our data support the formulation that this 
subtype of leukemia may arise in part because of a 
chromosome translocation that activates the IL-3 gene, 
resulting in autocrine and paracrine growth effects. 
® 1990 by The American Society of Hematology, 

16.6 mmol/L anunonium sulfate, 1 .5 mmol/L eadi dNTP and Taq 
polymerase (Po'kui-Ehner, Norwalk, CT)." 

Sequencing, Sequencing was done by chain termination m Ml 3 
vectors." As part of this study, we sequenced a subclone of a normal 
IL-3 promoter, covering 598 basi^pairs from a Sma I site at position 
— 1240 (with respect to the proposed site of transcription initiation) 
to an Nhe I site at position -642. The plasmid containing this region 
was a gift from Naoko Arai of the DNAX Research Institute. 

Expression in Cos? cells. A genomic IL-3 fragment from Case 1 
was cloned into the pXM expression vector.'^ Briefly, the Hindlll/ 
Sal I fragment containing the IL-3 gene was subcloncd from the 
previously described phage clone 4 into pUC18.^ The 2.6 kb 
fragment extending from the Sma I site 61 bp upstream of the 11^3 
transcription start to the Snui I site in the polylmker was cloned into 
the blunted Xho I site of pXM. The negative control construct was 
the pXM vector without insert Plasmids were inUoduced into Cos7 
cells by electroporation, and supernatant was collected after 48 
hours in culture. 

TFl bioassay, TF-1 cdls were passaged in RPMI 1640 supple- 
mented with 10% heat-inactivated fetal bovme serum, 2 mmol 
L-glutamine, and I ng/mL human GM-CSF." Samples and antibod- 
ies were diluted in this same medium lacking GM-CSF but contain- 
ing penicillin and strq>U)mydn, A 25 ^L volume of serial dilutions of 
patient serum was added to wells in a flat bottom 96-we!l microliter 
plate. Rat anti-cytokine monoclonal antibody in a volume of 25 itL 
was added to appropriate wells and preincubated for I hour at 37'C. 
Fifty microliters of twice washed TF-I cells were added to each well, 
giving a final cell concentration of 1 x 10^ cells per well (final 
ydume. 100 ^L). The plate was incubated for 48 houis. The 
remaining cell vxabUity was determmed metabdiicalty by the colori- 
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Breakpomt sequences for Case 2. The Qermline IgJhS region sequence (protein codina rAntnn .^^h ui ^ 
8oquencesareunderlined)isontap.thetranslo€ailoftsequencef^mC^ 

is fn the middle, and the germllne sequence, which we derived from e normal IL^ cTono. fs on S« ? l^J^ w ""f^*-""^' 

sequBiH^ has the same ni-cleotlde. TT,e sequence documents the head-t<Miead Joining of the 

gene occurred at position -934 (*). i no oreaicpoint (n the IL-3 
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metric method of Mosmann using a VMax microtiter plate reader 
(Molecular Devices, Menlo Park, CA) set at 570 and 650 nm.'* 

Cytokine immunoassays. These assays used lat monoclonal 
anti-cytokine anUWics (10 Mg/mL) to coat the wells of a PVC 
microtiter plate. The capture antibodies used were BVD3-6G8. 
JES1.39D10, and BVD2-23B6. for the BUS, 11^5, and GM-CSF^ 
assays, respectively. Patient sera were then added (undflutcd and 
diluted 1:2 for 11^3, undiluted for 11^5, and undiluted and diluted 
1:5 for GM-CSF). The detecting immimoreagents used were either 
mouse antiserum to IL-3 or nitroiodophenyl (NIP)-d«ivatizcd rat 
monoclonal anUbodies JESU5A2 and BVD2-21C11. specific for 
n.-5 and GM-CSF. respectively. Bound antibody was subsequently 
detected with immunoperoxidase conjugates: horseradish peroxidase 
(HRP)-labeled goat anti-mouse Ig for IL-3, or HRP-labeled rat (J4 
MoAb) anti-NIP for IL-5 and GM-CSF, The rfiromogenic sub- 
strate was 3-3'azino-bis-bcnzthiazoline sulfonate (ABTS; Sigma, St 
Louis, MO). Unknown values were interpolated from standard 
curves prepared from dilutions of the recombinant factors using 
Softmax software available with the VMAX microplate reader 
(Molecular Devices). 

RESULTS 

Leukemic DNA from Case 2 was studied by Southern 
blotUng. When digested with the jfiTrndlll restriction enzyme 
and hybridized with a human immunoglobulin heavy chain 
joining region (Jh) probe, a rearranged fragment at approxi- 
mately 14 kb was detected (data not shown). Wherrreprobed 
with either of two different IL-3 probes,.a rearranged 14 kb 
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^A^ri' ^^e^^ns ^'^ <k« rearranged Jh fragment, was 
identified. Whea leakemic DNA was digested with HindUl 
plus EcoKl, a rearranged Jh fragment was detected at 6 kb 
The IL.3 probes alsp. identified a comigrating fragment of 
-this size. These experiments indicated that the leukiemic 
sample studied was clonal and that a single fragment 
conUmed both Jh and IL-3 sequences, suggesting a translo- 
cation had occurred. 

To characterize better the joining of the IL-3 gene and the 
immunoglobulin heavy chain (IgH) gene, the polymerase 
Cham reaction (PCR) was used to clone the translocation." 
A Jh pruner and an IL-3 primer wwe designed to produce an 
amplified product in the event of a head-to-head transloca- 
tion. While control DNAgaveno PCR product. Case2 DNA 
yielded a PCR-deiived fragment of approximately 980 bp 
which was doned and sequMiced. 

The DNA sequence of the translocation done from Case 2 
confirmed the joining of the Jh region with the promoter of 
the lLr3 gene u a head-to-head configuraUon (Fig 1) 
Sequence analysis indicated that the breakpoint on chromo-' 
some 14 was just upstream of the Jh5 coding region. Tbo 
breakpomt on chromosome 5 occurred 934 bp upstream of 
the puutivc site of transcripUon initiation of the IL-3 gene 
We also determined that a putative N sequence of 1 7 bp was 
inserted between the chromosome 5 and chromosome 14 
sequences durmg die translocation event." " Figure 2 shows 
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Fig 2. Ralationship of ehromoaotne E breakpoints to tha IL-3 gend. This iigtira shows tha two doned brMknafnta lm«u^\ u. 

the normal IL-a Qono."" One breakpoint occurred at position -482 and the other at -934 terrowri ta b^Tti^^^L^''^' 
transtocMlon. recited in a head-to-head Joining of the IgH gene and the 11-3 gene, leaving tha mRNA an^ proteta codU.«SZ^^jfJ 
gene Intact. Boxea denote the Are VL-a axons: rMrietion enzyme* are iB) ShiiHI. «P) Pstf.WHpal (EJ f ooW wdOO Sto I 
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. Rg 3. Documentation of IL-3 mRNA over-expression. A Northern blot was prepared and hybridized with a probe for iL-3. Una 1 
contained RNA from unatfmuiatad per^heraf bfood lymphocytes (PBL) as a negative control, lane 2 contained RNA from PBL stimulated for 
4 hours with concanavalian A (ConAL and lane 3 contained RNA from PBL stimulated wKh ConA for 48 hours. As in the positive control 
lanes (2 and 3}, a 1 kb band was Identified in the leukemic sample from Case 1 {lane 4, lower arrow), suggesting aberrant expression of the 
IL-3 gene. In addition, the leukemic sample showed over-expression of an unspUced 2.9 kb IL-3 transcript (lane 4* upper arrow)« We 
di^umenteil that this represented an unsplioed precursor of the mature 1 kb transcript fay showing that this band hybridised to a ))roba 
fi-om faitron 2 of the IL-3 gene. A similar 2.9 kb band was detectM tn-lana-a. suggesting that an IL-3 mRNA of this sbe Is sometimes 
detectable In normal mitogen-stlmulated cells. Lane Bthrough 10 represent RNA from six samples of B-Knaage acuta lymphoeytte letdcemla 
without the t(6;14) trandocatian. Indicating that only the sample with the translocation exhibited IL-^ over-expression. Case 2 could not be 
analyzed by Northern btot because too few calls were available for study. 



the locations of the two cloned breakpoints in relation to the 
ILr-3 gene. The two chromosome 5 breakpoints were sepa- 
rated by less than 500 bp. 

The genomic structure in Cases 1 and 2 suggested that a 
normal IL-3 gene product was over-expressed as a result of 
the altered promotor structure. This would predict that the 
IL-3 gene on the translocated chromosome was capable of 
makmg IL-3 protein. This prediction was tested by express- 
ing a genomic fragment from the translocated allele of Case 
1 containing all five IL-3 exons under the control of the SV40 
promotor/enhancer in the Cos? cell Ime. Cell supematants 
were studied in a proliferation assay using the factor depen- 
dent erythroleukemic cell line, TF-1. The supematants* 
derived from transfections using the vector plus insert 
supported TF-l proliferation, while supernatants from trans- 
fections using the vector alone were negative in this assay 
(dallrftot shown). Furthermore^ the biol(^c activity could be 
blocked by an antibody to human 11^3 (BVD3-6G8). This 
result showed that the translocated allele retained the ability 
to make IL-3 mRNA and protein. 

The level of expression of lL-3 mRNA in leukemic cells 
from Case 1 was assessed. Northern blotting showed that the 
mature IL-3 mRNA (approximately 1 kb) and a 2.9 kb 
unspliced 11^3 mRNA were excessivdy produced by the 
leukenua (Fig 3). The 2.9 kb form of the mRNA is also 
preswit at low levels in normal peripheral blood T lympho- 
cytes after mitogen activation (Fig 3). Several B-Iineage 
acute leukemia samples without the t(5;14) translocation 
had undetectable levels of IL-3 mRNA in these experiments. 
In addition, although genes for GM-CSF and IL-5 map close 
to the IL-3 gene and might have been deregulated by the 
translocation* no 11^5 or GM-CSF mRNA could be detected 
in the leukemic sample (data not shown)."*^ 

Three serum samples from Case 2 were assayed by 
immunoassay for levels of GM-CSF, and IL-5 (Table 
1). Serum IL-3 could be detected and correlated with the 
clinical course. When the patients leukemic cell burden was 



highest, the IL-3 levd was highest. No serum GM-CSF or 
IL-5 could be detected. 

Since the 11^3 immunoassay measured only immunoreac- 
tive factor, we confimed that biologically active IL-3 was 
present by usmg the TF-1 bioassay. This bioassay can be 
rendered monospecific using appropriate neutralizing mono- 
clonal antibodies specific for IL-3, IL^5, or GM-CSF. We 
observed that sera from 1-16-84 and 3-14-84 contained TF-1 
stimulating activity that could be blocked with anti-lL-3 
MoAb (BVD3-6G8), but not with MoAbs to IL-5 (JESI- 
39D10) or GM-CSF (BVD2-23B6) (Fig 4; GM-CSF data 
not shown). The amount of neutralizable bioactivlty in these 
two samples correlated very well with the difference in IL-3 
levels obtained by immunoassay for these samples. Flirther- 
more, the failure to block TF-1 proliferating activity with 
eithe^.5Lnti-IL-5 or anti-GM-CSF was consistent with-thp;, 
inability to measiue these factors by immunoassay .and 



Table 1 . Peripheral Blood Counts and Growth Factor Lavals 
at Different Times In Case 2 

Sample Date 





11/16/83 


t/16/84 


3/14/84 


Perlpharal blood counts (c8IIs/mL) 








WBC 


81,800 


116,600 


12,300 


Lymphoblasts 


0 


33,785 


0 


Eosinophils 


46,626 


73,080 


615 


Serum growth factor levels (pg/mU) 








tL-3 


<444 


7,998 


1,061 


GM-CSF 


<15 


<15 


<15 


IL-5 


<50 


<50 


<S0 



Peripheral Mood counts from Case 2 at three tfifferent time points with 
the corresponding growth factor levels quantified by immunoassay. The 
patient recehred chemotherapy between 1/16/84 and 3/1 4/84 to lower 
his leukemic burden.' No serum samplea ware avaitable Idr a similar 
analysis of Case 1. 

/Uibraviaglon: WBC, white blood cells. 
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Fig 4. Bioassay of serum Lmikwnle patient sera were tasted for Mooctive tL-a an4 0.-9 in tli« TF-1 Drolifar«]<.n -ru 
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vertlcal«cl8.Se™m from ell tlireetlmepolnt. was a,«iyedsimim,neously.Th^^ 



indicated that tbese other myeloid growth factors were not 
detectably circulating in the serum of this patient 

DISCUSSION 

In this report, we have extended our analysis of acute 
lymphocytic leukemia and eosinophilia associated with the 
t(5; 14) translocation. In both cases we have studied, we have 
documented the joining of the IL-3 gene from chromosome 5 
to the IgH gene from chromosome 14. The breakpoints on 
chromosome 5 are withm 500 bp of each other, suggesting 
that additional breakpoints will be clustered in a small region 
of the IL-3 promotor. The PCR assay we have developed will 
be useful in the screening of additional clmical samples for 
this abnormality. 

The finding of a disrupted 11^3 promotor assodated with 
an otherwise normal IL-3 gene Implied that this transloca- 
tion might lead to the over-expression of a normal IL-3 gene 
product. In this work, we have documented that this is tpie. 
In addition, neither GM-CSF nor IL-5 are over-expressed by 
the leukemic cells. Furthermore, in one patient, serum IL-3 
could be measured and correlated with disease activity. To 
our knowledge, this is the first measurement of human IL-3 
in serum and its association with a disease process. The 
measurement of serum IL-3 in this and other clinical settings 
may now be indicated. 



The finding of the IL-3 gene adjacent to a cancer- 
associated translocation breakpoint suggests that its activa- 
tion is important for oncogenesis. It is our thesis that an 
autocrine loop for IL-3 is important for the evolution of this 
leukemia.^' The excessive IL-3 production that we have 
documented would be one feature of such an autocrine loop. 
The final proof of our thesis must await additional data. In 
particular, from the study of additional clinical samples, it 
will be necessary to document that the IL-3 receptor is 
present on the leukemic cells and that anti-IL-3 antibody 
decreases proliferation of the leukemia in vitro. 

An important aspect of this work is the suggestion of a 
therapeutic approach for this disease. If an autocrine loop for 
IIy3 can be documented in this disease, attempts to lower 
drculating IL-3 levels or block the interaction of 11^3 with 
its receptor may prove useful. Because it is also possible that 
the eosinophilia in these patients is mediated by the para- 
crine ^ffects of leukemia-derived IL=^3, siixiilar interventions 
may improve this aspect of the dis^e. Antibodies or 
engineered ligands to accomplish these goals may soon be 
available. 
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Clinical and Pathologic Significance of the 
c-erbB-2 {HER-2/neu) Oncogene 

Timothy P. Singleton and Jolin G. Strickler 



^ Hie CrerhB-2 oncogene was first shown to have clinical significance in 1987 by 

! Slamon et al,'"' who reported that DUA amplification in breast carcino- 

J mas correlated with decreased survival in patients widi metastasis to axillary 

'< lymph nodes. Subsequent studies, however, of oerbB-2 activation in breast 

% carcinoma reached conflicting conclusions about its clinical significance. This 

I oncogene also has been reported to have clinical and pathologic imph'cations in 

i other neoplasms. Our review summarizes these various studies and examines 

I the clinical relevance of c-erbB-2 activation, which has not been emphasized in 

recent reviews. ^»=8,8S fl^e molecular biology of the c-erfeB-2 oncogene has been 
extensively reviewed^'*®-'® and will be discussed only briefly here. 



BACKGROUND 

!; The oncogene was discovered in the 1980s by diree lines of investiga- 

{ tion* The neu oncogene was detected as a mutated transforming gene in 

i neuroblastomas induced by etbylnitrosurea treatment of fetal rats.*-''^'^^ The c- 

^ erbB-2 was a human gene discovered by its homology to the retroviral gene v- 

I cr&B.^'*^'^ HER-2 was isolated by screening a human genomic DNA library for 

j homology with v-eriB." When the DNA sequences were determined subse- 

] quently, c*6rfeB-2, ffEil-2> and neu were found to represent the same gene. 

\] Recently, the c-eri>B-2 oncogene also has been referred to as NGL. 

^ The c-i;r&B-2 DNA is located on human chromosome I7q21^'^^ and codes 

for c-erbB-^2 mRNA (4,6 kb). which translates c^bB-2 protein (pl85). This 
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protein is a normal component of cytoplasmic membranes. 'Hie c-ariB-2 
oncogene is homologoxis with, but not identical to, c-erfcB-1, which is located 
on chromosome 7 and codes for thc cpidcrmal growth factor rcccptorA^w^Thc 
erbB-2 protein is a receptor on cell membranes and has intracellular tyrosine 
kinase activity and an extracellular binding domain.^ Electron microscopy 
with a polyclonal antibody detects c-eri?B-2 immunoreaclivity on cytoplasmic 
membranes of neoplasms, especially oh microvilli and the non-villous outer cell 
mcmbranc.^^ In normal cells, immunohistoohemical reactivity for is 
frequently present at the basolateral membrane or the cytoplasmic membrane's 
brush border.®-*'^ 

There is experimental evidence that c^r&B-2 protein may be involved in 
the pathogenesis of breast neoplasia. Overproduction of otherwise normal c- 
erbB*2 protein can transform a cell line into a malignant phenotype.^ Also, 
^cn the otC^ene containing an activating point mutation is placed in 
transgenic mice with a strong promoter for increased expression, the mice 
develop multiple independent mammary adenocarcinomas. In other experi- 
ments, monoclonal antibodies against the neu protein inhibit the growth (in 
nude mice) of a nett-transfoimed cell line,^^ and Immunization of mico with 
neu protein protects them from subsequent tumor challenge with the neu- 
transfbrmed cell Ifne.^^ Some authors have speculated that the use of antago- 
nists for the unknown ligand could be useful in future chemotherapy.^ FSirther 
review of this experimental evidence is beyond the scope of this article. 

The activation most likely occurs at an early stage of neoplastic 

development. This hypothesis is supported by the presence of c-erbB-2 activa- 
tion m both in situ and invasive breast carcinomas. In addition, studies of 
metastatic breast c*arcinomas usually demonstrate uniform c-er&B-2 activation 
at multiple sites in the same patient, althou^ c-efi&B-2 activation has 
rarely been detected in metastatic lesions but not in the primary tumor;*^'® *^ 
Even more rarely, c-erfcB-2 DNA amplification has been detected in a primary 
breast carcinoma but not in its lymph node metastasis.^ In patients who have 
bilateral breast neoplasms, both lesions have similar patterns of c-er/^B-2 activa- 
tion, but only a few sucU cases have been studied.^^ 

MECHANISMS OF o-ef6B^2 ACTIVATION 

The most common mechanism of c-er6B-2 activation is genomic DNA amplifica- 
tion, which almost always results in overproduction of c-crfeB-2 mRNA and 
protein. The c-cr^BT2 amplification may stabilize the overproduction of 

mHNA or protein througli unknown mechanisms. Human breast carcinomas 
with c-€r&B-2 amplification contain 2 to 40 times more c-er&B-2 DNA^* and 4 to 
128 times more c-erbB-l mRNA^*^ than found in normal tissue. Most human 
breast carcinomas with c-ertB-2 amplification have 2 to 15 times more c-erbB-2 
DNA. TUmors with greater amplification tend to have greater oveiproduc- 
tiOQj7,s2,65 non-mammary neoplasms that have been studied tend to have 
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similar levels of C'erbB-2 amplification or ovetproductibn relative to the corre- 
sponding normal tissue. 

The second most common mechanism of activation is overproduc- 

tion of c-erbB-2 mRNA and protein without amplification of c-erbB-2 DNA,®^ 
The quantities of mRNA and protein usually are less than those in amplified 
cases and may approach the small quantities present in normal breast or other 
tissues. Hic c-eriB-2 protein overproduction without mRNA overproduc- 
tion or DNA amplification has been described in a few human breast cardnomia 
cell lines. *^ 

Other rare mechanisms of oerbB-Z activation have been reported. Translo- 
cations involving the c-er&B-2 gene have been described in a few mammary and 
gastric carcinomas, although some reported cases may represent restriction 
firagment length polymorphisms or incomplete restriction enzyme digestions 
that mimic transrocations.^i'^''^'"*®'^^*^ A sitigle point mutation in the transmiem- 
brane portion of neu has been described in rat neuroblastoinas induced by 
ethylnitrosurea>'^ The mutated neu protein has increased tyrosine kinase activ- 
ity and aggregates at the cell membrane. Although there has been specula- 
tion that some of the amplified c-erbB-2 genes may contedn point mutations,^ 
none has been detected in primary human neoplasms.^^'"'"^ 

TECHNIQUES FOR DETECTING ACTIVATION 
Detection of c^rbB-2 DNA Amplification 

Amplification of 0'«r&fi-2 DNA is usually detected by DNA dot blot or South- 
em blot hybridization. In the dot blot method, the extracted DNA is placed 
directly on a nylon membrane and hybridized with a 06r6B-2 DNA probe. In 
the Southern blot meftod, the extracted DNA is treated with a restriction 
enzyme, and the fragments are separated by electrophoresis, transferred to a 
nylon membrane, and bybyidized with a c^rfcB-2 DNA probe. In both tech- 
niques, c-crbB-2 amplification is quantified by comparing the intensity (mea- 
sured by densitometry) of the hybridization bands from the sample with those 
fi-om ccmtrol tissue. 

Several technical problems may complicate the measurement of c-crfeB-2 
DNA amplification. First, the extracted tmnor DNA may be excessively de- 
graded or diluted by DNA firom stromal cells.»i Second, the c-erfcB-2 DNA 
probe must be carefully chosen and labeled. For example, oligonucleotide o 
0r&B-2 probes may not be sensitive enough for measuring a low level of c-erBB- 
2 amplification, because diploid copy numbers can be difiBcult to detect (unpub- 
lished data). Third, the total amounts of DNA in the sample and control tissue 
must be compensated for, often with a probe to an unaroplified gene. Many 
studies have used control probes to genes on chromosome I7» the location of c- 
er6B-2, to correct for possible alterations in chromosome number. Identical 
results, however, are obtained by using control probes to genes on other chro- 
mosomes,^'^-^ with rare exception.'*^ Studies using control probes to the beta- 
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globin gene must be interpreted with caution, because one allele of this gene is 
deleted occasionally In breast, carcinomas.^ 

Amplification of c-er&B-2 DNA was assessed by using the x>olymerase 
chain reaction (PGR) in one recent study.^^ Oligoprimers for the {>efhB-^2 gene 
and a control gene are added to the sample's DNA, and PGR Is performed. If 
the sample contains more copies of c-€rfcB-2 DNA than of the control gene, the 
c-erfcB-2 DNA is replicated preferentially 
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Detection of c-efbB-2 mRNA Overproduction 

Overproduction of mRNA usually is measured by RNA dot blot or 

Northern blot hybridization. 9oth techniques require extraction of RNA but 
otherwise are analogous to DNA dot blot and Southern blot hybridization. Use 
of TCR for detectioa of o^r£fB<2 mRNA has been described in 'two recent 
abstracts.«8»i« 

Overproduction of c-er&B*2 mRNA can be measured by in situ hybridi2a- 
tion. Sections are mounted on glass slides, treated with protease, hybridized 
with a radiolabeled probe, washed, treated with nuclease to remove unbound 
probe, and developed for autoradiography. Silver grains are seen only over 
tumor cells that overproduce c-erbB-2 mRNA- Negative control probes are 
used.^W'i^ Our experience indicates that these techniques are relatively insensi- 
tive for detecting c-6rfeB-2 mRNA overproduction in routinely processed tisr 
sue. Althotigh the sensitivity may be increased by modifications that allow 
simultaneous detection o(crerbB-2 DNA and mRNA> in situ hybridizaticm still 
is cumbersome and expensive (unpublished data). 

All of the above CrerbB'2 mRNA detection techniques have several prob- 
lems that make them more difficult to perform than techniques for detecting 
DNA amplification. One major problem is the rapid degradation of RNA in 
tissue &at is not immediately frozen or fixed. In addition, during the detection 
procedure, RNA can be degraded by RNase; a ubiquitous enzyme, which must 
be eliminated meticidously iBrom laboratory solutions. Third, control probes to 
genes that are uniformly expressed in the tissue of interest need to be carefiiUy 
selected. ' 

Detection of G^rbB-2 Protein Overproduction 

The most accurate methods for detecting c*^B*2 protein overproducdon are 
die Western blot method and knmunoprecipitation. Both techniques can docu- 
ment the binding specificity of various antibodies against c-erhB-2 protein. In 
Western blot studies, protein is extracted from the tissue, separated by electro- 
phoresis (according to size), transferred to a membrane, and detected by using an- 
tibodies to c-er&B^2. In immunoprecipitation studies, antibodies against o*eriB- 
2 are added to a tumor lysate* and the resulting protein-antibody precipitate is 
separated by gel electrophoresis and stained for protein. Both Western blot and 
immunoprecipitation are useful research tools but currently are not practical for 
diagnostic pathology IWo recent abstracts have described an enzyme-linked 
immunosorbent assay (EUSA) fi>r detection protein. 
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Overproduction of c-erfcB-2 protein is most commonly assessed by various 
Immunoblstochemical t^hniques. These procedures, often generate conflicting 
results, which are explained at least partially by ihroc factors. First, various 
studies have used different polyclonal and monoclonal antibodies. Because 
some polyclonal antibodies recognize weak bands in addition to Ihe 
protehi band on Western blot or immunoprecipitation, the results of these 
stadies should be interpreted with caut!on,*w6,^'*^ Eveii some monoclonal anti- 
bodies immunoprecipitate protein bands in addition to c-erfcB-2 (plSS).^'^^^® 
Second, tissue fixation contributes to variability between studies. For example, 
• some antibodies detect c^rbB'2 protein only in frozen tissue and do not' react 
in fixed tissue. In general, formalin fixation diminishes the sensitivity of 
Immunohistochemical methods and decreases the number of reactive cells.^^-" 
When .Bpuin^ fixative is-^i^edrther^ may be a higher percentage of positive 
cases.^ Third, minimal criteria Ibr interpreting immunohistochemical staining 
are generally lacking. Although there is general agreement that distinct crisp 
cytoplasmic membrane staining is diagnostic for c-er6B-2 activation in breast 
. carcinoma, the number of positive cells and the staining intensity required to 
diagnose c-erfcB-2 protein overproduction varies from study to study and firom 
antibody to antibody. Degradation of c-erbB-2 protein is not a problem because 
it can be detected in intact form more than 24 hours after tumor resection 
without fixation or freezing,^ 



ACTIVATION OF IN BREAST LESIONS 

Incidence of o^r6B-2 Activation 

Most studies of c-er&B-2 oncogene activation do not specify histobgfcal sub- 
types of infiltrating breast carcinoma. Amplification of c-erbB-2 DNA was found 
in 19.1 percent (519 of 2715) of invasive carcinomas in 25 studies (Ikble 1), and 
c-cr6B-2 mRNA or protein overproduction was detected in 20.9 percent (566 of 
2714) of invasive carcinomas in 20 studies. TVelve studies have documented c- 
erfaB-2 mRNA or protein overproduction in 15 percent (88 of 604) of carcinomas 
that ladced c-erbB-2 DNA amplification. 

The incidence of o-«rfeB-2 activation in infiltruting breast carcinoma varies 
with the histological subtype. Approximately 22 percent (142 of 650) of infiltrat- . 
ing ductal carcinomas have c-eriB-2 activation, as expected from the above 1 
data. Other variants of breast carcinoma with frequent c-erbB-2 activation are j 
inflammatory carcinoma (62 percent, 54 of 87), Paget's disease (82 percent, 9 of 
II). and medullary carcinoma (22 percent, 5 of 23). In contrast. c-erf>B-2 activa- { 
tion is infrequent in infiltrating lobular carcinoma (7 percent, 5 of 73) and 
tubular carcinoma (7 percent, I of 15). 

The c-er&B-2 protein overproduction is present In 44 percent (44 of 100) of 
ductal carcinomas in situ and especially comedocardnoma in situ (68 percent, 
49 of 72). The micropapillary type of ductal carcinoma in situ also tends to have 
c-er&B*2 activation,*"'"'*® especially if larger cells are present. The greater fre- 
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quency of e-er&B-2 protein overproduction in comedocarcinoma in situ, com- 
pared with infiltrating ductal carcinoma, could be explained by the feet that 
many infiltrating ductal carcinomas arise from other types of intraductal carci- 
noma, which show c-erbB-Z activation infrequently. Others have speculated 
that carcinoma in sitti with activation tends to regresH or to lose e- 

erbB'2 activation during progression to invasion, ^''■^■^ Infiltrating and in situ 
components of ductal carcinoma, however, usually ai'e similar with respect to c- 
€rb3-2 activation,^i'» althouj^ some authors have noted more heterogeneity of 
the immunohistochemical staining pattern in invasive thah in in situ carci- 
noma, Activation of.c-er&B-2 is infrequent in lobular carcinoma in situ. If 
lesions contain more than one histological pattern of carcinoma in situ, the c- 
erfcB-2 protein overproduction tends to occur in the comedocarcinoma in situ 
but may include other areas of carcinoma in situ.*^'^" Overproduction of c- 
erbB'& protein in ductal carcinoma in situ correlates with larger cell size and a 
periductal lymphoid infiltrate.^ 

Activation of c-erfoB-2 has not been identified in benign breast lesions, 
including fibrocystic disease, fibroadenomas, and radial scars (Table 2). Strong 
membrane immunohtstochemicBl reactivity fi>r c-er2rB-2 has not been described 
in atypical ductal hyperplasia, although weak accentuation of membrane staining 
has been noted infrequently.^^^s* normal breast tissue, o-eriB-2 DNA is 
diploid, and C'erfeB-2 is expressed at lower levels than in activated tumors,^**^®.Ba 

These preliminary data suggest that c-erbB-2 activation may not be useful 
ibr resolving many of the common problems in diagnostic surgical pathology. For 
example, c-erfeB-2 activation is infrequent in tubular carcinoma and radial scars. 
In addition, because o-erbB-2 activation is unusual in atypical ductal hyperplasia, 
cribriform carcinoma in situ, and papillary carcinoma in situ, detection of c-erfcB- 
2 activation in these lesions may not be helpfril in their difierential di^osis. The 
histological filatures of comedocardnonia in situ, which oommonly overproduces 
c-er&B-2, are unlikely to be mistaken for those of beni^ lesions. Activation of 

TABLE 2. C-wfeB^ ACTtVATTON \H BENIGN HUMAN BREAST USiONS 



Histological Diagnosis AmpllficBtfon* Overproduction Oveiprocluctlon 



0«fM-2DNA 



e^B-2mRNA 



e-erM-2 Protein 



Fibrocystic disease 
Atypical ductal tiyperplasia 



0/10» 



2(waak)/2l» 
1(cytopia5mio}/l3» 

0/4» 

0/21.»0/10.«* 

o/e.»o/a« 

0/22^ 



B&\\gn ductal hyperplasia 
Scteroeing adenosis 
Fibroadenomas 



0/16 » 0/6 « 
0/a,wOrt«i 



0/6 » 0/33* 



Radial scars 
Blunt dud adenosis 
'Breasl mastosls* 



0/3" 



•Shown B8 numbor d easos wiUi acQvettonAnumber of cases studtod; ni^mcQ to given a» a superscripL 
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c-6rbB-2, however, does fevor infiltrating ductal carcinoma over infiltrating 
lobularcarcinoma. Further studies of these issues \rould be useful. 

Correlation of c-erAB-2 Activation With Pathologic Prognostic Factors 

Multiple studies have attempted to correlate c-eriB-2 activation with various 
pathologic prognostic factors (Table 3). Activation of c-€r&Br2 was correlated 
with lymph node metastasis in 8 of 28 series, with higher histological grade in 6 
of 17 series, and with higher stage in 4 of 14 series. Large tumor size was not 
associated with activation in most studies (II of 14). Tetraploid DNA 

content and low proliferation^ measured by Ki*€7» have been suggested as 
prognostic fitctors and may correlate with c-er&B-2 activation.^^ 

Correlation of c^rbB^ Actuation WHh Clinical Prognostic Factors 

Various studies have attempted also to correlate activation with clinical 

features that may predict a poor outcome (Table 4). Activation of c-erbB-2 
correlated with absence of estrogen receptors in 10 of 28 series and with ab- 
sence of progesterone receptors in 6 of 18 series. In most studies, patient age 
did not correlate with c-er&B-2 activation, and, in the rest of the reports, c- 
er6B-2 activation was associated with either younger or older ages. 

Correlation of c-efdB-2 Activation With Patient Outcome 

Slamon et al^^ first showed that amplification of the c-er&B-2 oncogene inde- 
pendently predicts decreased survival of patients with breast carcinoma. The 
correlation of c-erbB-2 amplification with poor outcome was nearly as strong as 
the correlation of number of involved lymph nodes with poor outcome. Slamon 
et al also reported that c-er&B-2 amplification is an important prognostic indica- 
.tor only in patients wi& lymph node metastasis.''^^^ 

A large number of subsequent studies also attempted to correlate c-erfrB-2 
activation with prognosis (Table 5). In 12 series, there was a correlation be- 
tween oerbB'2 activation and tumor recurrence or decreased survival. In five 
of these series, the predictive value of c-sr&B-2 activation was reported to be 
independent of o^er prbgnbstic factbrs. In contrast, 18 series <lid not confirm 
the correlation of c-erfeB-2 activation with recunence or stuvival. Fbur possible 
explanations for this controversy are discussed below. 

One problem is that c-^bB-2 amplification correlates with prognosis 
mainly in patients with lymph node metastasis. As summarized in Table 5, most 
studies of patients with axillary lymph node metastasis showed a correlation of 
c-erbB"! activation with poor outcome. In contrast, most studies of patients 
without axillary metastasis have not demonstrated a correlation with patient 
outcome. Table 6 summarizes the studies in which all patients (with and with- 
out axillary metastasis) were considered as one group. There is a trend for 
studies with a higher percentage of metastatic cases to show an association 
between c-er&B*2 activation and poor outcome. Thus, most of the current 
evidence suggests that c-er&B-2 activation has prognostic value only in patients 
with oietastasis to lymph nodes. 
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TABLE 5. CORRELATON OF e^bM ACTIVATION WITH OUTCOME IN PATIENTS 
WITH BREAST CARCINOMA 



Numtef ofPatiants 



Type of H/totavfaaUsto 









Axfliary 


. NO 


StOtfStlCBl 






Activation^ 


Toler 


l^mph Nodes 




Arutfyaily 


Refefenco 


<0.05 


DNA 


176 






u 

m 


or . 


<6.06 


DMA 


61 






II 
U 


W 


<0.05 


DNA 


57 






[l 
v 


ec 
TO 


<0.06 


DNA 


41 






11 
u 


OQ 

v9 


<0.05 


RiBNA 


62 






11 
u 






Protein 


102 






M 

nn 


lUl 


<0.05 


DNA 




345 




M 

Ml 


Dl 


<0.05 


DNA 










17 


<0.05 


DNA 




91 




tl 


eey 
Of 


<0.05 


DNA 




86 




M 


70 

fa. 


<0.05 


ProterrvWB 




350 




M 


85 


<0.05 


Protein 




62 


44 


U 


101 


0.05-0.15 


DNA 


57 






U 


111 


0.O5-O.15 


Protein 


1B9 






M 


92 


0.05-0.15 


Protein 




120 




U 


86 


>0.15 


DNA 


lao 






U 


113 


>0.15 


DNA 


122 






M 


4 


>0,15 


DNA 


50 






U 


44 


>0.15 


mRNA 


57 






U 


50 


>0.15 


Protein 


280 






M 


68 


>ai5 


Pfotein 


195 






U 


11 


>0.15 


Protein 


102 






U 


39 


>0.15 


Protein 




137 




U 


17 


>ai6 


DNA 






181 


M 


81 


>0.15 


DNA 






159 


U 


17 


>0.15 


DNA 






73 


U 


87 


>0.15 


Proteln-WB 






378 


u 


86 


>0.15 


Pro1eln*Wa 






102 


u 


17 


>0,15 


Protein 






141 


u 


86 


>0.15 


Protein 






41 


u 


40 



"The endpolnts of thes9 studlaa worn tumor rocunom 

erbB-z acthmtton and a poorer paUent outcomo Is slatisUcally efgniftcant at <0.05. 19 «f equlvoeal'BtonlRcBnDe 
el 0,05 to 0.1 6< and Ib not fiignlflcairt at >0'.1 5. 

^Shown as VB(iat>la measured. Lettera -WB" Indicate assay by Western bhrt; the ottier protein studies used 
(mmunohlstochamfcal methods. 

» muHlvariBte statistical analysis; U » unlvariale staUsttcal anaVsle. 
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TABUe 6. PERCEhTTAGE OF BREAST CARCINOMAS WITH METASTASIS COMPARED 
WTH PROGHOSTIC SIGNIFICANCE OF G-erbB-Z ACTIVATION 



%of tumors wim 
fymph node 
metastasis In 
each study 



70- 



60- 



60- 



40- 



71 (DNA)» 



61 (ONA)a 

69(DNA)« 
58(Prote]nV" 



64(DNA)if« 



42(Proleln)82 



P<0.05 



+ 



0.0S<P<0.1S 



64(mRN/Q» 
^1(DNA)^ 



58 (ONA)^ 
57(DNAr« 
55(Proteln)» 

46(Protein)i* 
46(Protein)» 



H 

p>o:i6 



P for oorrelatton of o-er6B-2 aotivaOon w]th patient outcome. 

Each stucVs percentage of breast carofnornas ttRtti metestaslB ia compemd wHh tho oorrelslion between o- 
a/684! aotAration and outcome. These data inolud90%tlK)ses^ 

cancer paHeiKs. whether or not they had axMaty malastaals. Supersoitpta are iha refeienoea. bi inranttieses 
are the ^fpw of c-erb&-2 activation. P values are interpreied as Gn Table 3, 



A second problem is that various types of breast carcinoixia are grouped 
together in many survival studies. Because the current literature suggests that 
c-€rfcB-2 activation is infrequent in lobular carcinoina, studies that combine 
infiltrating ductal and lobular carcinomas may dilute the prognostic effect of c- 
er&B-2.activation In ductal tumors. In addition, most studies do not analyze 
inflammatory breast can^nozna separately. This condition frequently shows c* 
erbB-2 activation and has a worse prognosis than the usual mammary carci- 
noma, but it is an uncommon lesion. 

A third potential problem is the paucity of studies that attempt to correlate 
c^erbB-Z activation wdth clinical outcome in subsets of breast carcinoma without 
metastasis. Two recent abstracts reported that in patients without lymph node 
metastasis who had various risk factors for recurrence (such as large tumor size 
and absence of estrogen receptors), overexpression predicted early 

recurrence.23.BT patients with ductal carcinoma in situ, one small study found 
no association between tumor recurrence and c-erbB'2 activation.*^ 

A fourth problem is the lack of data regarding whether the prognosis 
correlates better with c-^rfeB-2 DNA amplification or with mRNA or protein 
oveiproduction. Most studies that find a correlation between o^6B-2 activa- 
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tion and poor patient outcome measure c-eriB-2 DNA amplification (Table 5), 
and breast carcinoma patients with greater amplification of c-cr6B-2 may have 
poorer survival '^ ^^ Recent studies suggest diat amplification has more prognos- 
tic power than overproduction," ^4,35 but the clinical si^iflcance of c-erfcB-2 
overproduction without DNA amplification deserves fialher research. Few 
studies have attempted to correlate patient outfcome with <yefbh-2 mRNA 
overproduction, and many studies of c-ertB*2 protein overproduction use rela- 
tively less reliable metliods such as immunohistochcnnical studies with poly- 
clonal antibodies. 

Comparison Of Activation With Other Oncogenes in 

Breast Carcinoma 

Other oncogenes that may haye prognostic implications in human breast cancer 
are reviewed cl$cwhere."ao6 ijis section will be restricted to a comparison 
between the clinical relevance of c-er&B-2 and these oriier ona>genes. 

Hie c-mi^c gene is often activated in breast carcinomas, but c-myc activa- 
tion generally has less prognostic importance than c-erfeB-2 activation,"*^'" ^''** 
One study found a correlation between Increased mRNAs of c^i?B-2 and c- 
myc, althougli other reports have notconiinned this,«.io6 Subsequent research, 
however, could demonstrate a subset of breast cardnomas in which omyc has 
more prognostic importance than o-eri>B-2, 

The gene c-crfeB-1 for the epidermal growth fector receptor (EGFB) is 
homobgous with c-^rbB-2 but is infirequently amplified in breast cardnomas.'^ 
Overproduction of EGFR, however, occurs more firequently than amplification 
and may correlate with a poor prognosis. In studies that have examined both c- 
erbE'2 and EGFR in the same tumor, c-erfeB-2 has a stronger correlation wift 
poor prognostic fectors Studies have tended to show no coxrehtion between 
amplification of c-er&B-2 and c-ar^B-l or overproduction of c-er2»B<2 and EGFR, 
although at the molecular level EGFR mediates phosphorylation of c^rAB-2 
protein. 5i'«^«i'«8*Aoo Recent reviews describe EGFR in breast carcinoma. 

Hie genes c-erbA and ear^l are homologous to the thyroid hormone recep- 
tor, and they are located adjacent to c-er^^B-E on chromosome 17. Hiese genes 
are frequently coamplified with c-erbB-2 in breast carcinomas* The absence of 
c-erbA expression in breast carcinomas, however, is evidence against an impor- 
t^t role for Ais gene in breast neoplasia.*** Amplification of &«rbB*2 can occur 
without ear-1 amplification, and these tumors have a decreased survival that is 
similar to tumors with both c-erfcB-2 and ear-1 amplification."^ Consequendy, 
c-er&B-2 amplification seems to be more important than amphfication ofc-erbA. 
or ear-h 

Other genes also have been compared with c-er^B-2 activation in breast 
carcinomas. One study fi^und a significant correlation between increased o-crfeB- 
2 mRNA and increased mRNAs otfos, platelet-derived growth factor chain A, 
and Ki-rasJ« Allehc deletion of c-Ha-roAr may indicate a poorer prognosis in 
breast carcinoma^^i but it has not been compared with G-er6B»2 activation. Some 
studies have suggested a correlation between advanced stage or recurrence of 
breast carcinoma and activation of any one of seveial oncogenes.^^ 
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ACTIVATION OF c-er6B*2 IN NON-MAMMARY TISSUES 

Incidence of c-erdB-2 Activation In Non-Mammary Ttasues 

Table 7 summarizes the noma! tissues in which c-erbB-i expression has been 
delected, usuaOy with ixmnuiiohistocheinical methods using pdydonal anti- 



TABLE 7. PRESEKKCE OR ABSENCE OF c-«iM-2 mRNA OR oerbB^ PROTEIN IN 
NORIIAL HUMAN TISSUES 



Tissues With 
mRNA 



Ttesues Producing 
o-efbB-2 Protein* 



Ttasueo Laeking 
c^r2)B*2 mRNA 



Tissues Lacking 
c-erijM Protein 



Skin" 



Stomach^* 

Jejunum** 

Colon" 

Kldnsya< 



Uver« 



Fetal brain<» 

Thyroid* 
titerus^ 



Placemen 



Epidermis" 
External root sheath^ 
Ecorlne sweat gtand^ 
Fetal oral mucosa^ 
Fetal esophagus^ 
Stomach^ 
Fetal Intesttne"^ 
Small Int^ne^^ 
Coton»« 
Fetal Mdneytti 

Fetal proximal tutwlett 
DIslal tubule^z 
Fetal collecting duct^ 
Fetal renal pelvis" 
Fetal ureter« 
Hepalocytes^ 
Pancreatic atini" 
Pancreafic du^e^^ 
Endocrine cells of Isleis 
of Langertiens^ 

^ Fetal, trachea^.. 
' Fetal bronc^les^ 
Bronchioles^ 



Fetal ganglk>n oalls^ 



KMneyaiM 



Ovary'* 
Bloodvessels^ 



Postnatal oral mucosal 
Postnatal esophagus"^ 



Glorneiulus^ 

Postnalai Bowman's capsule^ 
Postnatal proximal tutHile"^ 

Postnatal collecting duct« 
Postnatal renal paMs^B 
Postnatal fetal uretei«> 
Uver«w 



Pancreatic Islets^ 

Postnatal tracheav^ 
Postnatal tMontMoIestt 

Postnatal alveoll^^ 
Postnatal brBin>> 
Postnatal ganglion oells^ 



Endothelrum« 

Adfonacorlioalcelis^ 
Postnatal thymus® 
Fibroblasts^ 
Smooth muscle celts^ 
Carcfiac muscle celis^ 



This piotefn study used Western blots; tha lesiused mmunohlslochemlcQl msthods. 
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bodies. Only a few studies have been performed, and some of these do not 
demonstrate convincing ceU membrane reactivity in the published photo- 
graphs. The interpretations in these studies/ however, «re listed, with the 
caveat that Qiese findings should be confirmed by immunoprecipitation or ' 
Western or BNA blots. Production of c-erbB-2 has been identified in normal 
epithelium of the gastrointestinal tmct and skin. Discrepancies regarding c- 
eriB-2 protein in other tissues could be due, at least in part, to di£Ferences in • i 

techniques. \ 

The data on c-erbB>2 activation in various non-mammary neoplasms 
sho\ild be interpreted with caution, because only smaD numbers of tumors have 
been studied, usually by immunohistochemical methods using polyclonal anti- 
bodies. Studies using cell lines have been excluded, because oeQ culture can 
induce amplificatioh and overexpression of other genes,..alfhough this has not 
been documented for c*€rZ^B-2. 

Activation of c*eriB-2 has been identified in 32 percent (64 of WS) of 
ovarian carcinomas in eight studies (Table 8). One abstract^ stated that ovarian 
carcinomas contained significantly more c-erbB-2 protein than ovarian non- 
epithelial malignancies. Another report^* showed that 12 percent of ovarian 
carcinomas had c-erbB-Z overproduction without amplification. 

Activation of c-erbB-2 has been identified in 20 percent (40 of 198) of 
gastric adenocarcinomas in seven studies, including 33 percent (21 of 64) of 



TABCea, c^rbB-2 ACTIVATION IN HUMAN GYNECOLOGIC TUMORS" 















mRNA 


Protein 




c-QrdB-2 DNA 


Over- 


Ovcr- 


Tumor Type 


Ampimcallon 


produeU6n 


produeUon 


Ovafy-^dnDina, not otherwise 


31/120,«1/11 w 


23/67" 




Bpedfled 


Q/2»0/1ito 




36/72^1 


Ovary— serous (papillary) carcinoma 


2^,110 i/7,»i2 0/6» 






0\ffitfy— ^ndbmotriold carcinoma 








Ovary— mucinous carcTnoma 


W>0/1« 






Ovary— clear cell carcinoma 


0«,^«0/1^ 






Ovary— mixed epHheflal carcinoma 


o/zw 






Ovary— endometrioid borderline lumor 


0/1" 






Ovary— mucinous borderline tumor 


0/372 






Ovary— serous cyatadenoma 


0/4" 






Ovary— mudnous cystadenoma 








Ovary— sderosing stromal tumor 


0/1« 






Ovary— flbrothecoma 


0/1^ 






Uterus— endometrlaf adanocarclnoma 


0/4«0/1"o 







'Shown as number of casea wim ampnncstlon (or ovarpnoductlonVlotai numbar or casas aUxfied: reterenca is 
given as suparaorlpL All pretain studies used Immunohistochemical methocfs. 
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intestinal or tubular subtypes and 9 percent (4 of 47) of diffuse or signet ring cell 
subtypes (Table 9). Activation of has been detected in 2 percent (6 of 

281) of colorectal carcinomas, althougb an additional immunohistochemical 
study detected e^r&B-2 protein in seven of ei^t tissues fixed in Bouin s solu- 
tion. One study found greater immunohistochemical reactivity for c-BfiB-2 
protein in colonic adenomatous polyps than in the adjacent normal epithelium, 
using Bouin s fixative. Lesions with anapIasti^B features and progression to itival 
siv© carcinoma tended to show decreased immunohistochemical reactivity for c- 
erfeB.2 protein » Hepatocellular carcinomas (12 of 14 cases) and cholangiocarci- 
nomas (46 of 63 cases) reacted with antibodies against o-et*B~2 in one study, but 
some of these "positive" cases showed only diffuse cytoplasmic staining, which 

TABLE 9, ACTlVATiON IN HUMAN GASTROINTESTiHAL TUMOR S' 

i>eJtiB-2 
Pitotein 





eHBlW-2DNA 


Over- 


Tumor Type 


Ampltncation 


pioducUon 


EGophagus— squamous cell cardnoma 


0/1 iw 


Q/l« 


Stomach— carcinoma, poorly differentfaiad 






Slomaeb-^enocarcinoma 


2/24 «2/9,'w 2/8,111 






2/8WQ/l*M 




Stomach— carcinoma. Intestinal or tubular type 


5/1 0^« 


16/64» 


Stomach^arclAomai cfrffuse or algnet ring cell type 


0/2toB 




ColoiBctum— carcinoma 


2/49,M l/45,"« 


1/22," 7/8** 




1/45,«1/45« 




0/40."0/3E,i«'0/3« 




Colon^vlilous adenoma 


0/15P 




Coton — tubulovlUous adenoma 






Colon— tubular adenoma 




1S/10a» 


Coion-^hyperplastlo polyp 


Q/lflO 




(ntesfine^-^-lelomyosarobma 






Hepatocellular carcinoma 


0/12111 


12/14.«Q/2«i 


Hepatoblastoma 


0/1W 




Cholanglocardnoma 






Pancreas— adenocarcinoma 




2/B0,*te0/2« 


Pancreas— acinar caitlnoma 






Pancreas— dear cell carcinoma 




Offiii 


Pancreas— large cell carcinoma 




0/3*1 


Pancreas— signet ring carcinoma 




0/141 


Pancreas-H:hronlc Inflammation 







•Shown as number of cases with ampfiflcaton (or overproductJonyiotal mimtiar of cases cUxfled: reteranoe Is 
I given as superscript All protein studies used Immunohlstochemioal methods. No studies an^ed (6r o-e/tB- 

2mRNA. 

^(ssues llxQd in Souln's eohJtlon. 

eOnV cases M^h distinct membrane staining are Interpreted as showing o^B-2 overproduction. 
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YA3Lc 10. 0€rf?9>2 ACTVATiOM 8?^ HUMAM PULMOWRY TURiORS^ 



Tumor Typa 


o«n>B-2 0NA 
Ampllflcallon 


c^B*2 
Protein 
Overproduction 


Non-small cell carcinoma 


2/eo,«»Q^eoB' 


1/84«» 


Epidermoid cardnoma 


0/1 3,82 0/1 0,^0/6» 


3/6W 


Adenocarcinoma 


0/21 w 1/13 » 0/7,?" 0/7 w 0/3IW 


4/12* 


Urge ceil ca/clnoma 






Smafi can carcinoma 




0/26,»0^ 


Carcinoid tumor 


0/1« 


.0/^ 



■Shown as number of oasas with ampllfioation (or overproductlonVtotai number til caaes eluded; reference Is 
given as eupeiBCt^ All protein stucfies used Immufiohlslochamlee) mattioda. Ho studies aneryzBd for o«6B- 
2mRNA. 



does not indicate c-cr&B-2 activation in breast neoplasms.^ Also, some pancre- 
atic carcinomas and chronic pancreatitis tissue had cytoplasmic immuhohlsto- 
chemical reactivity for o-ar&B-2 protein, in addition to the xare case of pancre- 
atic adenocarcinoma with distinct cell membrane staining.^ 

Tables 10 through 14 summarize the studies of c^rbB-2 activation in other 
neoplasms. The c-erfcB-2 oncogene is not activated in most of these tnmors^ 
ActivaHon of c.firiB-2 has been detected in 1 percent (4 of 299) of pubnonary 
non-small cell carcinomas in nine studies, although oiie additional report^ 
found C'erbB'2 protein overproduction in 41 percent (7 of 17). Renal cell carci- 
noma had c-erfeB-2 activation in 7 percent (2 of 30) in four studies. Overproduc- 
tion of c-criB-2 protein was described in one transitional cell carcinoma of the 
urinary bladder, a grade 2 papiOary lesion.^ Squamous cell carcinoma and basal 
ceD carcinoma of the skin may contain c-«r&B-2 protein, but It is not dear 



TAB1£11. c^bB^ ACTIVATION IN HUMAN HEMATOLOGIC PROUPERATIONS" 







e<«fbB*2 


c-erbB-2 






mRNA . 


Protein 


Tumor Type 


e<«rM-2DNA 


Ovefi 


Over- 


Amplification 


production 


production 


Hematolosic mallgnandas 


0/23111 






Mal^anl lymphoma 


O/d.s'Q/S'w 


0/1' 


0/15^ 


Acute leukemia 


0/1 46^ 






Acute lymphoblastic leukemia 


0/1 w 






Acute myeloblastic touicemla 


Q/3W 






Chronic leukemia 


o/iow 






Chronic lymphocyUo leukemia 


0/61W 






Chronic myelogenoua leukemia 








Myeloproliferative dteorxier 


0/1» 







; 



«Shown as mimbsr of oaass wlih ampfifteatJon {or ovarpTOduceor»)/lotal number of eases studied; retorance Is 
given as suparscrfpl. All prole(n studies used Immunohtetochemlea) methods. 
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TABLE la, 0€ffaB>2 ACTIVATIOM IN HUMAN TUMORS OF SOFT TISSUE AND BONE> 





0-ef/»B-2 0NA 


Tumor Type 


AmpliflcaUon 


Sarcoma 


0/10,"' 0/B» 


Malignant fibrous histiocytoma 


0/1 w 


Uposarcoma 




Pteomofphic aareoma 


0/1107 


Rhabdomyosarcoma 


0/1 tQ7 


Odteogenlo sarcoma 




Chondrosarcoma 


0/1167 


Swing's sarcoma 




Schwannoma 


0/1*' 



-Shown as number of cases with ampllficaUon (or ovarpmdtidlanVtatal numbw of casn siudtotf: refftrenoa Is 
given as superscript. No studies analyad for crmbB-Z mRNA or owtM protein. 



whether the pTX)tein level is increased over that of normal sldn » Thyroid 
carcinomas and adenomas can have low leveb of increased c-eriB-2 mBNA. 
One abstract, described low-level DNA amplification In one of ten 

salivary gland pleomorphic adenomas.^ 

Correlation of e^rt)B-2 AcHvatlon With Patient Outcome 

Very few studies have attempted to correlate c-er6B-2 activation in non- 
mammary tumors with outcome.- Slamon et aP' showed that c-«rfcB-2 amplifica- 
tlon or overexpression In ovarian carcinomas correlates with decreased survival, 
especially v/hen marked activation is present. However, they didnot report the 
stage, histological grade, or histological subtype of these neoplasms. Another 
study of stages III and IV ovarian carcinomas fbimd a correlation between 
decreased survival and c-eriB-2 protein overproduction, but not between sur- 
vival and histological grade. ^ One abstract slated that c-er!?B*2 protein ovexpro- 
duction in 10 of 16 pulmonaiy adenocarcinomas correlated with decreased 
disease-firee interval.^ Another abstract described a tendency fiir immunohisto- 



TA8LE 13, ^rbB-l AOTIVATiON IN HUMAN TUMORS OF THE URINARY TRACT- 



Tumor Type 


CH&r6B-2 0NA 
Amplifleaiion 


mRNA 
Over- 
production 


Protein . 
Over* 
production 


Wdney— renal cell carcinoma 


1/6,6r 1/v«0/5M 


O/16'W 




Wilms' tumor 








Pfoslata— adenocarcinoma 






0/2388 


Urinary bladder— carcinoma 






1/46» 



■Shown as number of ca&eB wrfih amplfficatlon (or overproduclton)/tolal number of cases stucfled: raferenoe Is 
givon aa auperscrlpt. Afl protein atudiaa used immuruhlstochemieal methods. 
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TABLE 14. t>9/»MACTVATI0MiNMtSC£LUWE0US HUMAN TUMORS- 



Tumor Type 



frwbM Protein 
ONA c-er6&-2mRNA Over* 

AmplWfcatlon Owrproduotton praductlon 



Skin— malignant melanoma 
Skin» head and neck — squamous 
cell carcinoma 

Site not elated-^squamous cell 
carcinoma 

Salivary gland— adenocarcinoma 
Parottd gland— adenoid cyeUc 
carcinoma 

TKyioW— anaplastic carcinoma 
ThyroW— papillary carcinoma 
Thyroid-^^denocarclnoma 
TTiyrold— adenoma 
Neuroblastoma 
Meningioma 



O/B.»^0!/2» 



1/1 » 



0/11 

0/5' 

0/1M 

o/35« mj^w^ 



0/1 0» 



0/1M 



3(towlevels)/5^ 
1(towlevBl6)/2« 



et^. « A^^^ ampilRcaUon {or ovorprodueUoi^ numbar or cases studied: referBnoa te 

given as fiuperBoript All protein 8tiidtesU89dlrwnunoW^^ 



chemical reactivity for o-criB-S protein to correlate with higher grades of pros^ 
tatic adenocarcinoma.^^ Additional prognostic studies of ovarian carcinomas and 
omer neoplasms are needed. 



SUMMARY 

Activation of the oncogene can occur by amplification of c-crM-S 

DNA and by overproduction of c-erfcB.2 mRNA and protein. Appraxi- 

k of breast carcinomas show evidence of c-erfeB.2 activation, 

Mrhich correlates with a poor prognosis primarily in patients with metastasis to 
axUlary lymph nodes. Studies that have attempted to correlate c-erbh^2 activa^ 
tion with other prognostic factors in breast carcinoma have reported conflicting 
conclusions. The pathologic and clinical significance of c^r*B-2 activation in 
other neoplasms is unclear and should be assessed by additional studies. 
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I, Paul Polakis,Ph,D., declare and say as follows: ) 

1. i was awarded a Ph.p. by the Department of Biochemistry of the Michigan 
State University in 1 984. My scientific Curriculum Vitae is attached to and fonns 
part.of this Declaration (Exhibit A). 

Z. I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist S ince joining Genentech in 1999, one of my p^rimaiy responsibilities has 
been leadmg Genentech's Tumor Antigen Project, which is a large research project 
with a pnmary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3 As part of the Twnor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins, that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on conresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins", When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective theirapeutic in the treatment Of human cancer. 

4. In the course of the research conducted by Genentech's Tumor Antigen 
Project, we haive employed a variety of scientific techniques for detecting and 
studymg differential gene expression in human tumor cells relative to normal cells 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
whjch has proven to be extremely usefiil for the identification of mRNA molecMes 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis. We have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date we 
have generated antibodies that bind to about 30 of the tumor antigen proteins ' ' 
expressed fix)m these differeiitially expressed gene transcripts and have used these 

antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein exi)ression analyses described in paragrjtoh 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 



expressed from that mRNA in that cell type. In approximately 80% of our . 
observations we have found that increases in the level of a particular ttiRNA 
correlates with changes in flie level of protein e5q)ressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my . 
knowledge of the relevant scientific literature, it is my considered scientific 
. opinion that for human genes, an increased level of riiRNA in a fumor cell relative 
. to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
ceoitr^ dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While tiiere have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
■ that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7- I hereby declare that all statemaits made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
arid ftirther that these statements were made with tiie knowledge tiiat.willfril &lse 
statements and the like so made are pvu^ishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willfiil 
statements may jeopardize the validity of the application or any patent issued 
thereon. 
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5^a{tr. cts. If these minor cell proteins differ among cells to the same extent as the 
' ^^Q^e abundant proteins, as is commonly assumed, only a small number of pro- 
tein differences (perhaps several hundred) suffice to create very large differences 
in ceU morphology and behavior, 

A Cell Can Change the Expression of Its Genes 
in Response to External Signals ^ 

Most of the specialized cells in a multicellular organism are capable of altering 
their patterns of gene expression in response to extracellular cues. If a liver cell 
is ejq)osed to a glucocorticoid hormone, for example, the production of several 
specific proteins is dramatically increased. Glucocorticoids are released during 
periods of starvation or intense exercise and signal the liver to increase the 
production of glucose from amino acids and other small molecules; the set of 
proteins whose production is induced includes enzymes such as tyrosine amino- 
transferase, which helps to convert tyrosme to glucose. When the hormone is no 
longer present, the production of these proteins drops to its normal level. 

Other cell types respond to glucocorticoids in different ways. In fat cells, for 
example, the production of tyrosine aminotransferase is reduced, whUe some 
o&er cell types do not respond to glucocorticoids at all. These examples illiistrate 
a general feature of cell specializatioiv^-different cell types often respond in dif- 
ferent ways to the same extracellular signal. Underlying this specialization are 
features that do not change, which give each cell type its permanently distinc- 
tive character. These features reflect the persistent expression of different sets of 
genes. 



Gene depression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protem ^ 

If differences between the various cell types of an organism depend on the par- 
feito genes that the cells ^ress, at what level is the control of gene expression 
exercised? There are many steps in the pathway leading from DNA to protein, and 
all of them can in principle be regulated. Thus a ceU can control the proteins it 
makes by (1) controlling when and how often a given gene is transcribed (tran- 
scriptional control), (2) controlling how the primary RNA transcript is spliced or 
othemise processed (RNA processing control), (3) selecting which completed 
mRNAs in the cell nucleus are exported to the cytoplasm (RNA transport con- 
trol), (4) selecting which mRNAs ui the cytoplasm are translated by ribosomes 
[tiranslatlonal control), (5) selectively destabilizing certain mRNA molecules in 
tHe cytoplasm (mRNA degradation control), or (6) selectively activating, inacti- 
vatmg, or compartmentalizing specific protein molecules after they have been 
inade (protein activity control) (Figure 9-2). 

For most genes transcriptional controls are paramount. This makes sense 
ecause, of lall the possible control points illustrated in Figure 9-2, only transcrip- 
aonal control en$ures that no superfluous intennediates are synthesized. In the 




\|™5crii)tron8l 



Fi^e 9-2 SiK steps at which 
eucaryote gene expression can be 
controlled. Only controls that operate 
at steps 1 through 5 are discussed in 
this chapter. The regulation of protein 
activity (step 6) is discussed in 
Chapter 5; this includes reversible 
activation or inactivation by protein 
phosphorylation as well as 
irreversible inactivation by proteolytic 
degradation. 
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following sections we discuss the DNA and protein components that regulate the 
initiation of gene transcription. We return at the end of the chapter to the other 
ways of regulating gene expression. 

Summary 

The smome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and RNA molecules. A cell typically expresses only d 
fraction ofits genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed Moreover, cells can change the pattern 
of genes th^ egress in response to chan^ in their environment, such as sigFudsfrom 
other cells. Although all of the steps involved in expressing a gene can in principle be 
regulated, for most genes the initiation of RNA transcription is the most important 
point of control 



DNA-binding Motifs in Gene 
Regulatory Proteins ^ 

How does a cell determine which of its thousands of genes to transcribe? As dis- 
cussed in Oiapter 8, the transcription of each gene is controlled by a regulatory 
region of DNA near the site wjtiere transcription begins. Some regulatory regions 
are simple and act as switches that are thrown by a single signal. Other regula- 
tory regions are complex and act as tiny microprocessors, responding to a vari- 
ety of signals that they interpret and integrate to switch the neighboring gene on 
or off. Whether complex or simple, these switching devices consist of two fun- 
damental types of components: (1) short stretches of DNA of defined sequence 
and (2) gene regulatory proteins tfiat recognize and bind to them. 

We begin our discussion of gene regulatory proteins by describing how these 
proteins were discovered. 



Gene Regulatory Proteins Were Discovered Using 
Bacterial Genetics ^ 

Genetic analyses in bacteria carried out in the 1950s provided the first evidence 
of the existence of gene regulatory proteins that turn specific sets of genes on 
or off. One of these regulators, the lambda repressor, is encoded by a bacterial 
virus, bacteriophage lambda. The repressor shuts off the viral genes that code for 
the protein components of new virus particles and thereby enables the viral ge- 
nome to remain a silent passenger in the bacterial chromosome, multiplying with 
the bacterium when conditions are favorable for bacterial growth (see Figure 
6-80), The lambda repressor was among the first gene regulatory proteins to be 
characterized, and it remains one of the best understood, as we discuss later. 
Other bacterial regulators respond to nutritional conditions by shutting off genes 
encoding specific sets of metabolic enzymes when they are not needed. The lac 
repressor, for example, the first of these bacterial proteins to be recognized, turns 
off the production of the proteins responsible for lactose metabolism when this 
sugar is absent firom the medium. 

The first step toward understanding gene regulation was the isolation of 
mutant strains of bacteria and bacteriophage lambda that were unable to shut 
off specific sets of genes. It was proposed at the time, and later proved, that most 
of these mutants were deficient in protfeins acting as specific repressors for these 
sets of genes. Because these proteins, like most gene regulatory proteins, are 
present in small quantities, it was difficult and time-consuming to isolate them. 
They were eventually purified by fractionating cell extracts on a series of stan- 
dard chromatography columns (see pp. 166-169). Once isolated, the pro- 
teins were shown to bind to specific DNA sequences close to the genes that they 



Figure 9-3 Double-heUcal structure 
of DNA. The major and minor grooves 
on the outside of the double helix, a'* 
indicated. The atoms are colored as 
follows: carbon, dark Wmc; nitrogen 
light blue; hydrogen, white; ovi^ 
red; phosphorus, yellow* 
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Figure 9-71 A mechanism to explain 
both the marked deficiency of CG 
sequences and the presence of CG 
islands in vertebrate genomes, A 
black line marks the location of an 
unmethylated CG dinudeotide in the 
DNA sequence, while a red line marks 
the location of a methylated CG 
dinudeotide. 
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Summary 

Tke many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many spe- 
dalized animal cells can maintain their unique character when grown in culture, the 
gme regulatory mechanisms involved in creating them must be stable once estab- 
Ushed and heritable when the cell divides, endoiving the cell with a memory of its 
developmental history, Procaryotes and yeasts provide unusually accessible model 
sy&tms in which to study gene regulatory mechanisms, some of which may he rel- 
evant to the creation of specialized cell types in Mghereucaryotes. One such mecha- 
nism involves a competitive interaction between two (or more) gene regulatory pro- 
teins, each of which inhibits the synthesis of the other, this can create a flip-flop 
smtdi that switches a cell between two alternative patterns of gene expression, Di- 
rea or indirect positive feedback loops, which enable gene regulatory proteins to 
perpetuate their own synthesis, provide a general mechanism for cell memory. 

In eucaryotesgene transcription is generally controlled by combinations of gene 
regulatory proteins. It is thought that each type ofceU in a higher eucaryotic organism 
contains a specific combination of gene regulatory proteins that ensures the expr^- 
5w?n of only those genes appropriate to that type of cell A given gene regulatory pro- 
tein may be expressed in a variety of circumstances and typically is involved in the 
regulation of many genes. 

In addition to diffusible gene rotatory proteins, inherited states of chromatin 
condensation are also utilized by eucaryotic cells to regulate gene expression. In ver- 
i^frates DNA methyiation also plays a part, mainly as a device to reinforce decisions 
obom gene expression that are made initially by other mechanisms. 



Posttranscriptional Controls 

Although controls on the initiation of gene transcription are the predominant 
wnn of regulation for most genes, other controls can act later in the pathway 
D'om RNA to protein to modulate the amount of gene product that is made. Al- 
H^ough these posttranscriptionai controls, which operate after RNA polymerase 
as bound to the gene's promoter and begun RNA synthesis, are less common 
^transcriptional control, fox many genes they are crucial. It seems that every 
in gene expression that couId.be controlled in principle is likely to be regu- 
ated under some circumstances for some genes. 

We consider the varieties of posttranscriptionai regulation in temporal or- 
according to the sequence of events that might be experienced by an RNA 
^^ecule after its transcription has begun (Figure 9-72). 
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Figure 9-72 Possible post- 
transcriptionai controls on gene 
expression. Only a few of these 
controls are likely to be used for any 
one gene. 
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amount of protein A in the cell to be 
much greater than that of protein B. 
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FROM DNATO RNA 

Itanscription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of 
protein rapidly when necessary. But each gene can also be transcribed and 
translated with a different efficiency allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in 
the next chapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment— most obviously by controlling 
the production of its RNA. 

Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence — a gene— into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still written in essentially the same language as it is in DNA — 
the language of a nucleotide sequence. Hence the name transcription. 

Like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds (Figure 6^). It differs from 
DNA chemically in two^ respects: (1) the nucleotides in RNA are 
ribonucleotides— that is, they contain the sugar ribose (hence the name ribonu- 
cleic add) rather than deojcyribose; (2) although, like DNA, RNA contains die 
bases adenine (A), guanine (G), and cytosine (C), it contains the base uracil (U) 
instead of the thymine (T) in DNA. Since U, like T, can base-pan: by hydrogen- 
bonding with A (Figure 6-5), the complementary base-pairing properties 
described for DNA in Chapters 4 and 5 apply also to RNA (in RNA, G pairs with 
C, and A pairs with U). It is not uncommon, however, to find other types of base 
pairs in RNA: for example, G pairing with U occasionally 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in overall structure. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single-stranded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up to form the final shape of 
a protein (Figure 6-6) . As we see later in this chapter, the abOity to fold into com- 
plex ttiree-dimensional shapes allows some RNA molecules to have structural 
and catalytic functions. 




Transcription Produces RNA Comptementar/ to 
One Strand of DNA 

All of the RNA in a cell is made by DNA transcription, a process that has cer- 
tain similarities to the process of DNA replication discussed in Chapter 5. 
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Figure 6-89 Protein aggregates tliat cause human disease. (A) Schematic illustration of the type of 
conformational change in a protein that produces material for a cross-beta filament. (B) Diagram illustrating 
the self-infectious nature of the protein aggregation that is central to prion diseases. PrP Is highly unusual 
because the misfolded version of the protein, called PrP*, induces the nbrmal PrP protein it contacts- to 
change its conforniation, as shown. Most of the human diseases caused by protein aggregation are caused by 
the overproduction of a variant protein that is especially prone to aggregation, but because this structure is 
not infectious in this way»it cannot spread from one animal to another. (C) Drawing of a cross-beta fllannent, 
a common type of protease-resistant protein aggregate found in a variety of human neurological diseases. 
Because the hydrogen-bond interactions in a P sheet form between polypeptide bacld)one atonu (see Figure 
3-9), a number of different abnormally folded proteins can produce ^is structure. (O) One of several 
possible models for the conversion of PrP to PrP*» shovdng the likely ciiange of two (x-helices into four 
^sQTmds. Although the structure of the normal protein has been determined accurately, the structure of the 
infectious form Is not yet Icnovm with certainty because the aggregation has prevented the use of standard 
structural techniques. (C, courtesy of Louise Serpeli. adapted from M. Sunde et al.,/ Mof. Bfoi 273:729-739, 
1 997; D, adapted from S.B. Prusiner. Trends fltochem. Sd. 2 i :482-487, 1 996.) 

animals and humans. It can be dangerous to eat the tissues of animals that con- 
tain PrP*, as vTitnessed most recently by the spread of BSB (conunonly referred 
to as the "mad cow disease") from cattle to humans in Great Britain. 

Fortunately, in the absence of PrP*, PrP is extraordinarily difficult to convert 
to its abnormd form. Although very few proteins have the potential to misfold 
Into an infectious confonnation, a similar transfoimation has been discovered 
to be the cause of an otherwise mysterious 'protein-only inheritance" observed 
in yeast cells. 

There Are Many Steps From DNA to Protein 

We have seen so far in this chapter that many different types of chemical reac- 
tions are required to produce a properly folded protein from the information 
contained in a gene (Figure 6-90). The final level of a properly folded protein in 
a cell therefore depends upon the efficiency with vdiich each of the many steps 
is performed. 

We discuss in Chapter 7 that cells have the ability to change the levels of 
their proteins according to their needs. In principle, any or all of the steps in Fig- 
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Figure 6-90 The production of a 
protein by a eucaryotic cell. The final 
level of each protein in a eucaryotic cell 
depends upon the efficiency of each step 
deplaed. 
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ure &~90) could be regulated by the cell for each individual protein. However, as 
we shall see in Chapter 7, the initiation of transcription is the most common 
point for a cell to regulate the expression of each of its genes. This makes sense, 
inasmuch as the most efficient way to keep a gene from being expressed is to 
block the very first step-^e transaiption of its DNA sequence into an RNA 
molecule. 



Summary 

The translation of the nucleotide sequence of an mRNA molecule into protein takes 
place in the cytoplasm on a large Hbonucleoprotein assembly called a ribosome. The 
amino adds used for protein synthesis are first attached to a family of tRNA 
molecules, each of which recognizes, by complementary base-pair interactions, par- 
ticular sets of three nucleotides in themRNA (codons). Tkesequence ofnucleotides in 
the mRNA is then readftom one end to the other in sets of three according to the 
genetic code. 

To initiate translation^ a small ribosomal subunit binds to the mRNA molecule 
at a start codon (AUG) that is recognized by a unique initiator tRNA molecule* A 
large ribosomal subunit binds to complete the ribosome and begin the elongation 
phase of protein synthesis. During this phase, aminoacyl tRNAS'—eadt bearing a 
specific amino acid bind sequentially to the appropriate codon in mRNA by forming 
complementary base pairs with the tRNA anticodon. Each amino acid is added to the 
C'terminal end of the growing pofypeptide by means of a cycle of three sequential 
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Figure 7-5 Six steps at which 
eucaryotic gene expression can be 
controlled. Controls that operate at 
steps I through S are discussed ip this 
chapter. Step 6, the regulation of protein 
activity, includes reversible activation or 
Inactlvation by protein phosphorylation 
(discussed In Chapter 3) as well as 
irreversible inactivation by proteolytic 
degradation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

If differences among the various cell types of an organism depend on the partic- 
ular genes that the cells express, at what level is the control of gene e3q)ression 
exercised? As we saw in the last chapter, there are many steps in the pathway 
leading from DNA to protein, and all of them can in principle be regulated. Thus 
a cell can control the proteins it makes by (1) controlling v\^en and how often a 
given gene is transcribed (traiiscriptionai control], (2) controlling how the RNA 
transcript is spliced or otherwise processed (MNA processing control). (3) 
selecting which completed mRNAs in the cell nudeus are exported to the cytosol 
and determining where in the cytosol they are localized (RNA transport and 
localization control), (4) selecting which mRNAs in the cytoplasm are translated 
by ribosomes (translational control), (5) selectively destabilizing certain mENA 
molecules in the cytoplasm (mRNA degradation control), or (6) selectively acti- 
vating, inactivating, degrading, or con^)artmentalizing specific protein 
molecules after they have been made (protein activity control) (Figure 7-5) . 

For most genes transcriptional controls are paramount. This makes sense 
becat^e, of all the possible control points illustrated in Figure 7-5, only tran- 
scriptional control ensures that the cell will not synthesize superfluous interme- 
diates. In the following sections we discuss the DNA and protein compon«its 
that perform this function by regulating the initiation of gene transcription. We 
shall xetum at the end of the chapter to the additional ways of regulating gene 
expression* 

Summary 

Thegpnome ofa ceU contains in its DHA sequence the information to make many 
thousands of different protein and RNA molecules. A ceU typically expires onfy a 
fraction of its genes, and the different types ofceUs in muiticeUidar organisms arise 
because different sets ofgmes are expressed. Moreover, ceOs can change the pattern 
of genes they express in response to changes in dieir environment, such as signals 
from other cells. Although all of the steps involved in expressing a gene can in prin- 
ciple be regulated, for most genes the initiaUon of UNA transcription is the most 
. important point of control 



DNA-BINDING MOTIFS IN GENE REGULATORY 
PROTEINS 

How does a cell determine which of its thousands of genes to transcribe? As 
mentioned briefly in Chapters 4 and 6, the transcription of each gene is con- 
polled by a regulatory region of DNA relatively near the site where transcription 
begins. Some regulatory regions are simple and act as switches that are thrown 
oy a single signal. Many others are complex and act as tiny microprocessors, 
re^onding to a variety of signals that they interpret and integrate to switch the 
" '^^ghboring gene on or off. Whether complex or simple, these switching devices 
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occur in the germ line, the cell lineage that gives rise to sperm or eggs. Most of 
th2 DNA in vertebrate germ cells is inactive and highly methylated. Over long 
periods of evolutionary time/ the methylated CG sequences in these inactive . 
regions have presiunably been lost through spontaneous deamination events 
that were not properly repaired However promoters of genes that remain active 
in the germ cell lineages (including most housekeeping genes) are kept 
unmeth)4ated, and therefore spontaneous deaminations of Cs that occiir with- 
in them can be accurately repaired. Such regions are preserved in modem day 
vertebrate cells as CG islands. In addition, any mutation of a CG sequence in the 
genome that destroyed the function or regulation of a gene in the adult would be 
selected against, and some CG islands are simply the result of a higher than nor- 
mal density of critical CG sequences. 

The manunalian genome contauis an estimated 20,000 CG islands. Most of 
the islands mark the 5' ends of transcription units and thus, presumably, of 
genes. The presence of CG islands often provides a convenient way of identify- 
ing genes in the DNA sequences of vertebrate genomes. 

Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many 
specialized animal cells can maintain their unique character through many cell 
division cycles and even when grown in culture, the gene ri^latory mechanisms 
invoh/ed in creating th&n must be stable once established and heritable when the 
cell divides. These features eruiow the cell with a memory of its developmental history. 
Bacteria and yeasts provide unusually accessible model systems in which to study 
gene regulatory mechanisms. One such mechanism involves a competitive interac- 
tion between two gene regulatory proteins, each of which inhibits the synthesis of the 
other, this can create aflip-Jlpp switch that switches a ceU between two alternative 
patterns of gene expression. Direct or iruiirect positive feedback loops, which enable 
gene regulatory proteins to perpetuate their own synthesis, provide a general mech- 
anism for cell memory. Negative feedback loops with programmed delays form the 
basis far cellular clodcs. 

Ineucaryotes the transcription ofa gene is genemlly controlled by combinations 
of gene regulatory proteins. It is thought that each type of cell in a higher eucaryotic 
organism contains a specific combination of gene regulatory proteins that ensures 
the expression of only those genes appropriate to that type of cell A given gene regu- 
latory protein may be active in a variety of circumstances and typically is involved 
in the reguloHon of many genes. 

In addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also used by eucaryotic cells to regulate gene expression. An espe- 
ciatiy dramatic case is the inactivation of an entire X chromosome infenude mam- 
mals. In vertebrates DNA methylation also functions in gene regulation, bdng used 
mainly as a device to rdnfarce decisions about gene expression that are made ini- 
tially by other mechanisms. DNA methylation also underlies the phenomerwn of 
genomic imprinting in mammals, in which the expression ofa gene depends on 
wheOier it was inherited from the mother or Uie fasten 
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Figure 7-^6 A mechanism to explain 
both the marked overall deficiency 
of CG sequences and thdr clustering 
into CG islands in .vertebrate 
genomes. A black line marks the bcatlon 
of a CG dinucleotide !n the DNA 
sequence* while a ne</ "lollipop" Indicates 
the presence of a methyl group on the 
CG dinucleotide. CG sequences that lie ?n 
regulatory sequences of genes that are 
transcribed In germ cells are unmethylated 
and therefore tend to be retained In 
evolution. Meth/lated CG sequences, on 
the ofher hand, tend to be lost through 
deamination of 5-methyl C toT, untess the 
CG sequence is critical for survival. 



POSTTRANSCRIPTIONAL CONTROLS 

In principle, every step required for the process of gene expression could be 
controlled. Indeed, one can find examples of each type of regulation, although 
any one gene is likely to use only a few of them. Controls on the initiation of 
gene transcription are the predominant form of regulation for most genes. But 
other controls can act later in the pathway from DNA to protein to modulate 
the amount of gene product that is made. Although these posttranscriptional 
controls, vAnch operate after RNA polymerase has bound to the gene's promoter 
^d begun RNA synthesis, are less common than transcriptional control^ for 
many genes they are crucial. 
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CHAPTER 29 
♦Ke^fHSlKflWf transcription 



tli^ pli^notypic dilTi'ffncr^s \\m <ii5\mguish the 
^.jirious ktmis of ivlls in a hijshrr vukaryute are 
fnrgriy dii^ to' illirerernres in thif expression at ' 
^rnes thni code ftir pniieins. iliai is. Iho.se Iran- 
5Pril>^cl hy RNA (Mtlyinenise II. In prineipte. ilie 
f ^pression of Ihese {sent** ox't^lW lie lY^utated ai 
any one of several .sl«i$es. Tl>e concept of ihe 
'level of conlroP implies ihni gene expression 
i5 pol necessarily an auumiatic process once il 
has be^n. li could regulated in' a gene- 
specific u-ar ai any one of several sequential 
slep$- ^^c can dlMin^iitsti (al lenM) five poten- 
liat control points, rnrmln^ Itie series: 

Artivalion of ^tvw sirneuuv 
i 

InttiAltnn of transrriplinn 
i 

Frotvsjiing the ii';ui.<«Tipj 

ti9ti$por^ to c>'t»pl«sni 
i 

TrauMtilitm of niRNA 

TIte exi5leiire of ttte first is im]>lied liy 
Ihe discovery Ihul ^enes niny exisi in either of 
hvii stniclural condiiioiw. ttelnilve lo (he stale 
(ir most of llie genome, (^enes »re round in 
iin "arlive* Male in the cells in wnfdt they 
anf expressed (see Cliapier iTj. Tli€ ct\ange ct 
siriicture is distinct firtnn ilie aci of iranscrip- 
lion, and indicates <i>ai Ihe gene is transcribe 
able/ This suggests thai acquisilion or llie 
"active* structure mtisl be ihe first step in gene 
expression. 

Transcription of a gene in the active stale is 



controlled at thr stage of initiation, ihai Is. l>y 
the Inievaction of RNA Molynierase \vtth its pni- 
nioter- lliis Is now becoming susceptible to 
analysis In the iVr vitro s>^lejns (see Chapter 
For most genes, tills is a nialnr conirni 
pohii: proiiably it is Ihe most c«nnii)on level of 
regnlallnn. 

There is at present no evidence for controi 
at subse(|uent stages of Iranscripiion in etikary- 
olic cclls» for example \ia anlilerniinatton 
mechanisnis. 

The primary transcript is modified by capping 
at itie 5' end« and nsnally also t)y polyadenyia- 
lion at the V end. tmrons must l)e spliced out 
fruin the transcripts of interrupted genes. The 
molure RNA must tic espnited from the nucleus 
lo Hie crtoplnsni. tieiwlation «>r gene expression 
hy selection of se<|ueiice5 at the irvet of nuclear 
RNa nit<:hi involve any tix all of ilicse slages. 
hut the unc fur niiicii n-e have most evidence 
concerns changes in splicing: some genes are 
e.\prr5sed l>> means of ultmuuive splicing pul- 
tems vvhtise ivgulation coiUroIs Ihe type of pro> 
lein product (see Cliapier M}). 

Finally. Ihc irnaslation of an mRNA In the c>to- 
plasm can Ue speciHcaMy controlled. Tltere is little 
evidence for tlie emplo)*meiil of this inedtanism in 
adull somalic cells. hu| it dnes occur in some 
enihr>'nnic siluaiions, as desei'lf>e(l in Chapter 
-The mechaiitsm is presun\ed to involve the hlock- 
tng of intilsilon of translaiiqn of some mRNAs by 
specific protein factors. 

But having acknowledged that control of gene 
expression can occur at m'uiUpLe stages, and 
that production of RNA cannot ineWtably be 
equaled v^ith production of protein, it Is dear 
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that the overwhelming majority of regu)atoi7 
. events occur at Ihe intdalion of iranscriptton. 
Regulation uf tlssue-specinc gene iranscription 
lies at Khe heart of eukaryotic dilTerentialion; 
indeed, we see examples in Chapler 3S in 
which proteins lliat regulaie embr>'onic devel- 
opment prove to be Iraitscriplion factors. A reg- 
ulatoo' transcripiion factor serves (o provide 



confimon control of a large number of largei 
genes, and we seek lo answer two questions 
about this mode of regulation: what identtftes 
Ihe common target genes to the iranscripiion 
factor; and how is the activity of the irnnscrip- 
tion factor itself regulated in response to imrin- 
sic or extrinsic signals? 



Response elements idenlii'y genes under common 
regulation 



The principle that emerges from dianfcterizing 
groups of genes under common tronirol is that 
they share a promoter eletnenl that is rtcogniz^d 
by o ngnlatory transaiption factor An element 
that causes a gene to respond to such a Factor 
is called a response element examples are the 
HSE (heat shoch response element), GRE 
(glueocorticoid response element^ 5RE (scrum 
response elemeiU}. 

The properties of some inducible transcription 
factors and the elements that they recognize are 
summarized in. Table 29,1, Kesponse elements 
have the same general characterisiics as 
upstream elements of promoters or enliancers. 
They contain short consensus sequences, and 
copies of the response elements found in dif- 
fercnl genes are closely related, but not neces- 
sarily identical. The region bound by the factor 
exiends for- a short distance on either side of 



Table 23.1 (ndi^citjJe lranscr;pvor; faclcfs bind ic 
response elumGrls lhal idonl.fy Qfctps cf prcmclcrs 
or cnhuncsrs SLt;5Ct Jo coc;dina;G ccrlrol. 



Regtilfltory Agervl Module Consensus Fador 

Heaai»»C* HSC CNNGAANMTCCNNQ HSTF 

Glueooof1ico» GRE TGQTACMATGTrCT Bfceplor 

PnwbolBStW TRE TGACTCA API 

Seram SRE CCATATTAGG SW 



the consensus sequence. In promoters, the elr- 
meats are not preseiit at fixed distances ^rt^m 
the siartpoint. but are usually <200 bp upsOrtam 
Of It. The presence of a single element usuailf 
Is siiillclem to confer the regulatory response* 
but sometimes there arc multiple copies. 

Kesponse elements may be located in 
moters or in enhancers. Some types of elemcnis 
are typically found in one rather than the otbrr. 
usually an HSE is found fin a promoter* whii^ * 
G?& is found in an enhancer. We assume th»{ 
all response elements function by the >a*^*' 
general princl|)le. A gfne is regulated 6/ ^ 
sequence ai ttie promoter or enhancer (hai 
recognized by a specific protein. The p'*'^^" 
/unctions as a transcription factor needed 
HI^'A pol\tnenac to initiate' Active protriJf ^ 
oiMiiatfie only under condition token the, ^ 
fo be ccprtssed; its absence means that ttif ^ 
mater- is not activated by this particular ^'^'^^ 
.An example of a situation in which "'■''^^^ 
genes are controlled by a single factor is 
\1ded by the heat shock response. This ^^^^ 
mpn to a wide range of prokoryo*^* 
eukarj'oles and Involves multiple <^"'^*j,rr 
gene expression; an increase in temp*^ 
turns off transcription of some genes. ^^^^^^^ 
transcription of Ihe heat shock B^^^'^^^- 
causes changes in the translation 
The control of the heat shock genes 
the differences between prokaiyotic ^^^^ 
eukaryoiic modes of control In. bacieriat ^ 
Sigma factor is synthesized lhal ^^^^^ ^^tf 
polymerase holoenzyme lo recognize f^^- 
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Abstract 

Background: Prosute stem cell antigen (PSCA) is a recently defined homob^e of the Thy-l/Ly-6 family of 
gtycosylphosphatidylinositol (GPI)-anchored ceil surface antigens. The purpose of the present study was to 
examine the expression status of PSCA protein and mRNA in clinical specimens of human prostate cancer (Pea) 
and to validate it as a potential molecular target for diagnosis and treatment of Pea, 

Materials and Methods: Immunohistochemical (IHC) and in situ hybridization (ISH) analyses of PSCA 
expression were simultaneously performed on paraffin-embedded sections from 20 benign prostatic hyperplasia 
(BPH). 20 prostatic intraepithelial neoplasm (PIIM) and 48 prosute cancer (Pea) tissues, including 9 androgen- 
Independent prostate caneers. The level of PSCA expression was semiquantitathreiy scored by assessing both the 
percentage and Intensity of PSCA-positive staining cells in the specimens. Then compared PSCA expression 
between BPH» PIN and Pea tissues and analysed the correlations of PSCA expression level widi pathological grade, 
clinical stage and progression to androgen-independence in Pea. 

Results: In BPH and low grade PIN, PSCA protein and mRNA staining were weal< or negative and less intense 
and uniform than that seen in HGPIN and Pea. There were moderate to strong PSCA protein and mRNA 
expression in 8 of 1 1 (72.7%) HGPIN and in 40 of 48 (83.4%) Pea specimens examined by IHC and ISH analyses, 
with statistical significance compared with BPH (20%) and low grade PIN (22.2%) samples (p < 0,05. respectively). 
The expression level of PSCA increased with high Gleason grade, advanced stage and progression to androgen- 
independence (p < O.OS. respectively). In addition. IHC and ISH staining showed a high degree of correlation 
between PSCA protein and mRNA overexpression. 

Conclusions: Our data demonstrate that PSCA as a new cell surface marker is overexpressed by a majority of 
human Pea. PSCA expression correlates positively with adverse tumor characteristics, such as increasing 
pathological grade (poor cell differentiation), worsening clinical stage and androgen-independence, and 
speculatively with prosute carcinogenesis. PSCA protein overexpression results from upregulated transcription 
of PSCA mRNA. PSCA may have prognostic utility and may be a promising molecular target for diagnosis and 
treatment of Pea. 
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Introduction 

Prostate cancer (Pea) is the second leading cause of can- 
cer-related death in American men and is becoming a 
common cancer increasing in China. Despite recently 
great progress in the diagnosis and management of local- 
ized disease, there continues to be a need for new diagnos- 
tic markers thai can accurately discriminate between 
indolent and aggressive variants of Pea. There also contin- 
ues to be a need for the identification and characterization 
of potential new therapeutic targets on Pea cells. Current 
diagnostic and therapeutic modalities for recurrent and 
metastatic Pea have been limited by a lade of specific tar- 
get antigens of Pea. 

Although a number of prostate-specific genes have been 
identified (i.e. prostate specific antigen, prostatic acid 
phosphatase* glandular kallikrein 2], the majority of these 
are seaeted proteins not ideally suited for many immuno- 
logical strategies. So, the identification of new cell surface 
antigens is critical to the development of new diagnostic 
and therapeutic approaches to the management of Pea. 

Reiter RE et al [1] reported the identification of prostate 
stem cell antigen (PSCA), a cell surface antigen that is pre- 
dominantly prostate specific. The PSCA gene encodes a 
123 amino acid glycoprotein, with 30% homology to 
stem ceil antigen 2 (Sea 2). Like Sca-2, PSCA also belongs 
to a member of lheThy-l/Ly-6 family and is anchored by 
a glycosylphosphatidylinositol (GPI) linkage. mRNA in 
situ hybridization (ISH) localized PSCA expression in nor- 
mal prostate to the basal cell epithelium, the putative 
stem cell compartment of prostatic epithelium, suggesting 
that PSCA may be a marker of prostate stem/progenitor 
cells. 

In order to examine the status of PSCA protein and mRNA 
expression in human Pea and validate it as a potential 
diagnostic and therapeutic target for Pea, we used immu- 
nohistochemisuy (IHC) and in situ hybridization (ISH) 
simultaneously, and conducted PSCA protein and mRNA 
expression analyses in paraffin-embedded tissue speci- 
mens of benign prostatic hyperplasia (BPH, n « 20), pros- 
tate intraepithelial neoplasm (PIN, n = 20) and prostate 
cancer (Pea, n = 48). Furtheimoie, we evaluated the possi- 
ble correlation of PSCA expression level with Pea tumori- 
genesis, grade, stage and progression to androgen- 
independence. 

Materials and methods 

Tissue samples 

All of the clinical tissue specimens studied herein were 
obtained from 80 patients of 57-84 years old by prostate- 
ctomy, transurethral resection of prostate (TURP) or biop- 
sies. The patients were classified as 20 eases of BPH, 20 
cases of PIN, 40 cases of primary Pea, including 9 patients 
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with recurrent Pea and a history of androgen ablation 
therapy (orchiectomy and/or hormonal therapy), who 
were referred to as androgen-independent prostate can- 
cers. Eight specimens were harvested from these andro- 
gen-independent Pea patients prior to androgen ablation 
treatment. Each tissue sample was cut into two parts, one 
was fbced in 10% formalin for IHC and the other treated 
with 4% paraformaldehyde/0.1 M PBS PH 7.4 in 0.1% 
DEPC for I h for ISH analysis, and then embedded in par- 
affin. All paraffm blocks examined were then cut into 5 
^m sections and mounted on the glass slides specific for 
IHC and ISH respeaivcly in the usual fashion. H&E- 
stained section of each Pea was evaluated and assigned a 
Gleason score by the experienced urological pathologist at 
our institution based on the criteria of Gleason score |2). 
The Gleason sums are summarized in Table 1. Clinical 
staging was performed according to Jewett-whitmore- 
prout staging system, as shown in Table 2. In the category 
of PIN, we graded the specimens into two groups, i.e. low 
grade PIN (grade I - H) and high grade PIN (HGPIN. 
grade HI) on the basis of literatures |3,4]. 

/mmunohistochemicol (tHC) analysis 
Briefly, tissue sections were deparaffinized, dehydrated, 
and subjected to mierowaving in 10 mmol/L citrate 
buffer, PH 6.0 (Boshide, Wuhan, China) in a 900 W oven 
for 5 min to induce epitope retrieval. Slides were allowed 
to cool at room temperature for 30 min. A primary mouse 
antibody specific to human PSCA (Boshide, Wuhan, 
China) with a 1: 100 dilution was applied to incubate with 
the slides at room temperature for 2 h. Labeling was 
detected by sequentially adding biotinylated secondary 
antibodies and strepavidin-peroxidase, and localized 
using 3,3'-diaminobenzidine reaction. Sections were then 
counterstained with hematoxylin. Substitution of the pri- 
mary antibody with phosphate-buffered-saline (PBS) 
served as a negative-staining control. 

mRNA in situ hybridiiatlon (ISH) 
Five-^m-thick tissue seaions were deparaffinized and 
dehydrated, then digested in pepsin solution (4 mg/ml in 
3% ciuic acid) for 20 min at 37,5 ''C, and further proc- 
essed for ISH. Digoxigenin-labeled sense and antisense 
human PSCA RNA probes (obtained from Boshide, 
Wuhan, China) were hybridized to the sections at 48*C 
overnight. The posthybridization wash with a high strin- 
gency was performed sequentially at 37 * C in 2 « standard 
saline citrate (SSC) for 10 min, in 0.5 x SSC for 15 min 
and in 0.2 k SSC for 30 min. The slides were then incu- 
bated to biotinylated mouse anti-digoxigenin antibody at 
37.5*C for 1 h followed by washing in 1 » PBS for 20 min 
at room temperature, and then to strepavidin-peroxidase 
at 37.5 '^C for 20 min followed by washing in 1 x PBS for 
15 min at room temperature. Subsequently, the slides 
were developed with diaminobenzidine and then coun- 
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Intensity x frequency 



Gleason score 0-6 (%) 9 {%) 



2-4 5(83) 1(17) 

5-7 19(79) 5(21) 

WO 5(28) 13(72) 



Table Z: Correlation of PSCA expression with clinical stage 







Intensity ^ frequency 




Tumor stage 


0-6 (%) 




9(%) 




27 (67.5) 
2(25) 




13(32.5) 
6(75) 





terstained with hematoxylin to localize the hybridization 
signals. Sections hybridized with the sense control probes 
routinely did not show any specific hybridization signal 
above background. All slides were hybridized with PBS to 
substitute for the probes as a negative control. 

Scoring methoiis 

To determine the correlation between the results of PSCA. 
immunostaining and mRNA in situ hybridization, the 
same scoring manners are taken in the present study for 
PSCA protein staining by IHC and PSCA mRNA staining 
by ISH. Each slide was read and scored by two independ- 
endy experienced urclogical pathologists using Olympus 
BX-41 light miaoscopes. The evaluation was done in a 
blinded fashion. For each seaion, five areas of similar 
grade were analyzed semiquantitaiively for the fraction of 
cells staining. Fifty percent of specimens were randomly 
chosen and rescored to determine the degree of interob- 
server and intraobserver concordance. There was greater 
than 95% intra- and interobserver agreement. 

The intensity of PSCA expression evaluated microscopi- 
cally was graded on a scale of 0 to 3+ with 3 being the 
highest expression observed (0, no staining; 1+, mildly 
intense; 2+, moderately intense; 3+, severely intense). The 
staining density was quantified as the percentage of cells 
staining positive for PSCA with the primary antibody or 
hybridization probe, as follows: 0 = no staining; 1 posi- 
tive staining in <25% of the sample; 2 « positive staining 
in 25%-50% of the sample; 3 => positive staining in >50% 



of the sample. Intensity score (0 to 3+) was multiplied by 
the density score (0-3) to give an overall score of 0-9 
[1,5]. In this way, we were able to differentiate specimens 
that may have had focal areas of increased staining from 
those that had diffuse areas of increased staining [6]. The 
overall score for each specimen was then categorically 
assigned to one of the following groups: 0 score, negative 
expression; 1-2 scores, weak expression; 3-6 scores, mod- 
erate expression; 9 score, strong expression. 

Stotfsticof anatysls 

Intensity and density of PSCA protein and mRNA expres- 
sion In BPH, PIN and Pea tissues were compared using the 
Chi-square and Student's c-test. Univariate associations 
between PSCA expression and Gleason score, clinical 
stage and progression to androgen- independence were 
calculated using Fisher's Exact Test. For all analyses, p < 
0.05 was considered statistically significant. 

Results 

PSCA expression In BPH 

In general, PSCA protein and mRNA were expressed 
weakly in individual samples of BPH. Some areas of 
prostate expressed weak levels (composite score 1-2), 
whereas other areas were completely negative (composite 
score 0). Four cases (20%) of BPH had moderate expres- 
sion of PSCA protein and mRNA (composite score 4-6) 
by IHC and ISH. In 2/20 (10%) BPH specimens, PSCA 
mRNA expression was moderate (composite score 3-6), 
but PSCA protein expression was weak (composite score 
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2) in one and negadvc (composite score 0) in the other. 
PSCA expression was localized to the basal and secretory 
epithelial cells, and prostatic stroma was almost negative 
staining for PSCA protein and mRNA in all cases 
examined. 

PSCA expresdon in PIN 

In this study, we detected weak or negative expression of 
PSCA protein and mRNA (^2 scores) in 7 of 9 (77.8%) 
low grade PIN and in 2 of 1 1 (18.2%) HGPIN, and mod- 
erate expression (3-6 scores) in the rest 2 low grade PIN 
and 5 of 11 (45.5%) HGPIN. One HGPIN with moderate 
PSCA mRNA expression (6 score) was found weak stain- 
ing for PSCA protein (2 score) by IHC. Strong PSCA pro- 
tein and mRNA expression (9 score) were detected in the 
remaining 3 of 11 (27.3%) HGPIN. There was a statisd- 
cally significant difference of PSCA protein and mRNA 
expression levels observed between HGPIN and BPH (p < 
0.05), but no statistical difference reached between low 
grade PIN and BPH (p > 0.05). 

PSCA expression in Pea 

In order 10 determine if PSCA protein and mRNA can be 
detected in prostate cancers and if PSCA expression levels 
are inaeased in malignant compared with benign glands. 
Forty-eight paraffin-embedded Pea specimens were ana- 
lysed by IHC and ISH. It was shown that i 9 of 48 (39.6%) 
Pea samples stained very strongly for PSCA protein and 
mRNA with a score of 9 and another 21 (43.8%) speci- 
mens displayed moderate staining with scores of 4-6 (Fig- 
ure 1). In addition, 4 specimens with moderate to strong 
PSCA mRNA expression (scores of 4-9) had weak protein 
staining (a score of 2) by IHC analyses. Overall, Pea 
e3q>ressed a significandy higher level of PSCA protein and 
mRNA than any other specimen category in this study (p 
< 0.05, compared with BPH and PIN respectively). The 
result demonstrates that PSCA protein and mRNA are 
overexpressed by a majority of human Pea. 

Comlatlon of PSCA expression witft Gleason score in Pea 

Using the semi-quantitative scoring method as described 
in Materials and Methods, we compared the expression 
level of PSCA protein and mRNA widi Gleason grade of 
Pea, as shown in Table 1. Prostate adenocarcinomas were 
graded by Gleason score as 2-4 scores = well-differentia- 
tion, 5-7 scores « moderate-differentiation and 8-10 
scores « poor-differentiation [7]. Seventy-two percent of 
Gleason scores 8-10 prostate cancers had very strong 
staining of PSCA compared to 21% with Gleason scores 
5-7 and 17% with 2-4 respectively, demonstrating that 
poody differentiated Pea had significantly stronger 
expression of PSCA protein and mRNA than moderately 
and well differentiated tumors (p < 0.05). As depicted in 
Figure 1, IHC and ISH analyses showed that PSCA protein 
and mRNA expression in several cases of poorly differen- 



tiated Pea were particularly prominent, with more intense 
and uniform staining. The results indicate diat PSCA 
expression increases signifieanUy with higher tumor grade 

in human Pea. 

Correlation of PSCA expression with clinical stage in Pea 
With regards to PSCA expression in every stage of Pea, we 
showed the results in Table 2, Seventy-five percent of 
locally advanced and node positive cancers (i.e. C-D 
stages) expressed stadstieally high levels of PSCA versus 
32.5% that were organ confined (i.e. A-B stages) (p < 
0.05). The data demonstrate that PSCA expression 
inaeases significantly with advanced tumor stage in 
human Pea. 

Correlation of PSCA expression with androgen- 
independent progression of Pea 

All 9 specimens of androgen-independent prostate can- 
cers stained posiUve for PSCA protein and mRNA. Eight 
specimens were obtained from padents managed prior to 
androgen abladon therapy. Seven of eight (87.5%) of 
these androgen-independent prostate cancers were in the 
strongest staining category (score = 9), compared with 
three out of eight (37-5%) of paUents widi androgen- 
dependent cancers (p < 0.05). The results demonstrate 
that PSCA expression increases significandy widi progres- 
sion to androgen-independence of human Pea. 

It is evident from the results above that within a majority 
of human prostate cancers die level of PSCA protein and 
mRNA expression correlates significandy with increasing 
grade, worsening stage and progression to androgen-inde- 
pendence. 

Correlation of PSCA immunostainlng and mRNA in situ 
hyt)ridixation 

In all 88 specimens surveyed herein, we compared the 
results of PSCA IHC staining with mRNA ISH analysis. 
PosiUve staining areas and its intensity and density scores 
evaluated by IHC were identical to diose seen by ISH in 79 
of 88 (89.8%) specimens (18/20 BPH, 19/20 PIN and 42/ 
48 Pea respectively). Importandy, 27/27 samples with 
PSCA mRNA composite scores of 0-2, 32/36 samples 
with scores of 3-6 and 22/24 samples with a score of 9 
also had PSCA protein expression scores of 0-2, 3-6 and 
9 respectively. However, in 5 samples with PSCA mRNA 
overall scores of 3-6 and in 2 with scores of 9 ihere were 
less or negative PSCA protein expression (i.e. scores of 0- 
4), suggesting that this may reflect posttranseriptional 
modificadon of PSCA or that the epitopes recognized by 
PSCA mAb may be obscured in some cancers. The data 
demonstrate that die results of PSCA immunosuining 
were consistent with those of mRNA ISH analysis, show- 
ing a high degree of correlation between PSCA protein 
and mRNA expression. 
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Figure i 

Representatives of PSCA IHC and ISH staining in Pea (A. IHC staining, B. ISH staining. x200 magnification). A,, B,: negative con- 
trol of IHC and ISH. PBS replacing the primary antibody (A,) and hybridization with a sense PSCA probe (B,) showed no back- 
ground staining. Aj, Bj: a moderately differentiated Pea (Gleason score = 3+3 = 6) with moderate staining (composite score = 
6) in all malignant cells; A^: IHC shows not only cell surface but also apparent cytoplasmic staining of PSCA protein. Aj, B,: a 
poorly differentiated Pea (Gleason score = 4+4 = 8) with very strong staining (composite score = 9) In all malignant cells. 
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Discussion 

PSCA is homologous to a group of cell surface proteins 
that mark the earliest phase of hematopoietic develop- 
ment. PSCA mRNA expression is prostate-specific in nor- 
mal male tissues and is highly up-regulated in both 
androgen-dependem and-independent Pea xenografts 
(IAPC-4 tumors). We hypothesize that PSCA may play a 
role in Pea tumorigenesis and progression, and may serve 
as a target for Pea diagnosis and treatment. In this study, 
IHC and ISH showed that in general there were weak or 
absent PSCA protein and mRNA expression in BPH and 
lovv grade PIN tissues. However, PSCA protein and mRNA 
are widely expressed in HGPIN, the putative precursor of 
invasive Pea, suggesting that up-regulation of PSCA is an 
early event in prostate carcinogenesis. Recently, Reiter RE 
etal [1], using ISH analysis, reported that97 of 118 (82%) 
HGPIN specimens stained suongly positive for PSCA 
mRNA. A very similar finding was seen on mouse PSCA 
(mPSCA) expression in mouse HGPIN tissues by Tran C. 
P et al |8]. These data suggest that PSCA may be a new 
marker associated with transformation of prostate cells 
and tumorigenesis. 

We observed that PSCA protein and mRNA arc highly 
expressed in a large percentage of human prostate cancers, 
including advanced, poorly differentiated, androgen- 

independent and metastatic cases. Fluorescence-activated 
cell sorting and confocal/ immu no fluorescent studies 
demonstrated cell surface expression of PSCA protein In 
Pea cells (9|. Our IHC expression analysis of PSCA shows 
not only cell surface but also apparent cytoplasmic stain- 
ing of PSCA protein in Pea specimens (Figure 1). One pos- 
sible explanation for diis is that anti-PSCA antibody can 
recognize PSCA peptide precursors that reside in the cyto- 
plasm. Also, it is possible that the positive staining that 
appears in the cytoplasm is actually from the overlying 
cell membrane [S], These data seem to indicate diat PSCA 
is a novel cell sui^ce marker for human Pea. 

Our results show that elevated level of PSCA expression 
correlates with high grade (i.e. poor differentiation), 
increased tumor stage and progression to androgen-inde- 
pendenee of Pea. These findings support the original IHC 
analyses by Gu Z et al [9], who reported that PSCA protein 
expressed in 94% of primary Pea and the intensity of 
PSCA protein expression increased with tumor grade, 
stage and progression to androgen-independence. Our 
results also collaborate the recent work of Han KR et al 
[10], in which the significant association between high 
PSCA expression and adverse prognostic features such as 
high Gleason score, seminal vesicle invasion and capsular 
involvement in Pea was found. It is suggested that PSCA 
overexpression may be an adverse predictor for recur- 
rence, clinical progression or survival of Pea. Hara H et al 
(1 1] used RT-PCR detection of PSA, PSMA and PSCA in 1 



ml of peripheral blood to evaluate Pea patients with poor 
prognosis. The results showed that among 58 PCa 
patients, each PCR indicated the prognostic value in the 
hierarchy of PSCA>PS/V>PSMA RT-PCR, and extraprostatic 
eases with positive PSCA PCR indicated lower disease-pro- 
gression-free survival than those with negative PSCA PCR« 
demonstrating that PSCA can be used as a prognostic fac- 
tor. Dubey P et al |12] reported that elevated numbers of 
PSCA + cells correlate positively with the onset and devel- 
opment of prostate carcinoma over a long time span in 
the prostates of the TRAMP and PTEN +/- models com- 
pared with its normal prostates. Taken together with our 
present findings, in which PSCA is overexpressed from 
HGPIN to almost frank carcinoma, it is reasonable and 
possible to use increased PSCA expression level or 
increased numbers of PSCA-positive cells in the prostate 
samples as a prognostic marker to predict the potential 
onset of this cancer. These data raise the possibility that 
PSCA may have diagnostic utility or dinical prognostic 
value in human Pea. 

The cause of PSCA overexpression in Pea is not known. 
One possible mechanism is that it may result from PSCA 
gene amplification. In humans, PSCA is located on chro- 
mosome 8q24.2 (1], which is often amplified in meta- 
static and recunent Pea and considered to indicate a poor 
prognosis [IS-ISJ. Interestingly, PSCA is in close proxim- 
ity to the e-myc oncogene, which is amplified in >20% of 
recurrent and metastatic prostate cancers [16,17). Reiter 
RE et al 1 18| reported that PSCA and MYC gene copy num- 
bers were co-amplified in 25% of tumors (five out of 
twenty), demonstrating that PSCA overexpression is asso- 
ciated with PSCA and MYC coamplification in Pea. Gu Z 
et al (91 recently reporteted that in 102 specimens availa- 
ble to compare the results of PSCA tmmunosiaining with 
their previous mRNA ISH analysis, 92 (90.2%) had iden- 
tically positive areas of PSCA protein and mRNA e:q>res- 
sion. Taken together with our fmdings, in which we 
detected moderate to strong expression of PSCA protein 
and mRNA in 34 of 40 (85%) Pea specimens examined 
simultaneously by IHC and ISH analyses, it is demon- 
strated that PSCA protein and mRNA overexpressed in 
human Pea, and that the increased protein level of PSCA 
was resulted from the upregulated transcription of its 
mRNA. 

At present the regulation mechanisms of human PSCA 
expression and its biological function are yet to be eluci- 
dated. PSCA expression may be regulated by multiple fac- 
tors [18]. WatabeTet al (19) reported that transcriptional 
control is a major component regulating PSCA expression 
levels. In addition, induction of PSCA expression may be 
regulated or mediated through cell-cell contact and pro- 
tein kinase C (PKC) |20]. Homologues of PSCA have 
diverse activities, and have themselves been involved in 
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carcinogenesis. Signalling through SCA-2 has been dem- 
onstrated to prevent apoptosis in immature thymocytes 
[21). Thy-1 is involved in T cell activation and transducts 
signals through src-like tyrosine kinases [22]. Ly-6 genes 
have been implicated both in tumorigenesis and in cell- 
cell adhesion [23-25]. Cell-cell or cell-matrix interaction is 
critical for local tumor growth and spread to distal sites. 
From its restricted expression in basal cells of normal 
prostate and its homology to SCA-2, PSCA may play a role 
in stem/progenitor cell function, such as self-renewal (i.e. 
antt-apoptosis) and/or proliferation (1). Taken together 
with the results in the present study, we speculate that 
PSCA may play a role in tumorigenesis and clinical pro- 
gression of Pea through affeaing cell transformation and 
proliferation. From our results, it is also suggested that 
PSCA as a new cell surface antigen may have a number of 
potential uses in the diagnosis, therapy and clinical prog- 
nosis of human Pea. PSCA overexpression in prostate 
biopsies could be used to identify patients at high risk to 
develop recurrent or metastatic disease, and to discrimi- 
nate cancers from normal glands in prostatectomy sam- 
ples. Similarly, the deteaion of PSCA-overexpressing cells 
in bone marrow or peripheral blood may identify and pre- 
dia metastatic progression better than current assays, 
which identify only PSA-positive or PSMA-positive pros- 
tate cells. 

In summary, we have shown in this study that PSCA pro- 
tein and mRNA are maintained in expression from 
HGPIN through all stages of Pea in a majority of cases, 
which may be associated with prostate carcinogenesis and 
correlate positively with high tumor grade (poor cell dif- 
ferentiation), advanced stage and androgen-independent 
progression. PSCA protein overexpression is due to the 
upregulation of its mRNA transcription. The results sug- 
gest that PSCA may be a promising molecular marker for 
the clinical prognosis of human Pea and a valuable target 
for diagnosis and therapy of this tumor. 
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Abstract 

Tifwislaikm inltMon is reguialed In response to 
mitrient avaHabilfty and mftogenlc stimulation and Is 
ooiq>led with ceD cyde progression and cell growth. 
Several alterations In translational control occur in 
cancer. Variant mRNA sequences can alter the 
Iransiationd efficiency of IndMdual mRNA molecules, 
which m Han piqr a role In cancer biology. Changes in 
the expresdon or avallalillRy of components of Itio 
Iranslatiimai madilnery and In the activation of 
transtatlon through signal transduction pathways can 
lead to mom glol»a] changesi such as an Increase In 
the Gvedll rate of protein synthesis and translational 
activation of the mRNA molecules btvolved in cell 
9vwtii and procuration. We review the basic 
principles of translational contioi, the idterations 
encountered In cancer, and selected therapies 
targeting translation initiation to help elucidate new 
therapeutic avamues;* 

introduction 

The ibidam&ital principle of molecular therapeufics in can- 
cer is to exploit the differences In gene expression t)etween 
cancer cells and normal cells. With the advent of cDNA an«y 
technology, most efforts have concentrated on lden%ing 
diffenences In gene expression at the level of mRNA, which 
can be attributable either to DNA amplification or to differ- 
ences In transcription. Gene expression Is quite complicated, 
however, and is also regulated at the level of mRNA stability, 
mRNA tmnslation, and proton stability. 

The power of translational regulation has been recog- 
nized anvong devebpmental biologists, because transcription 
does not occur in earty embiyogenesls In eidotyotes. For ex- 
ample^ in Xm)pus, the period of transcr^tional quiescence 
continues until tiie embryo reaches mldbiastula transMm 
4000-ceii staga Therefore, all necessary mRNA molecules are 
transcribed during oogeneste and stockpiled In a translatfonally 
Inactive, masked form. The mRNA are translafionally activated 
at appropriate times during oocyte matursUon, fertilization, and 



eariy embryogenesis and tiius, are under strict translational 
control. 

Tneuislalion has an established role In cell growtiL Basi- 
cally, an Increase in protein syntiiesis occurs as a conso- 
quence of mitogenesls. Until recently, however, little was 
known about the att^ations in mRNA translation In cancer, 
and much Is yet to be discov^^ about their role in the 
development and progr^on of cancer. Here we review the 
basic principles of translational control, tho altenations en- 
countered In cancer, and setected ti)et«9)ies targeting transia^ 
tion Initlattdn to elucidate potential new tifterapeutic avenues. 

Basic Principle oflVanslational Control 
Mechanism of Tran^aUon /irltfaffon 

Translation Initiation Is the main step in IrandaltoRal regufaktcm* 
Translation Initiatbn fcs a complex pnoce^ in which tiie initiator 
and tire 40S and 60S ribosomal subunits are racmited to 
ti)e5' end of a mRNA nx>lecut6 and assembled by eukaryottc 
tianslatlon Irtitiaaon iKdore into an 80S riboso^ 
codon of tiie mrasiA (F^g. 1); The 5' end of eul<aryotic mnm 
capped, ie,, contains tiw cap structure m^GpppN (7HTie1hy»- 
guanoslne-triplKispho^'^ribonudeoslde). Most translation In 
eukaryotes ooours in a ca{>depemtentfeshlon. La, the cap is 
spedfically recognized by tt»eelF4E,*whidit)irKfe tiie 5' cap. 
The ell^ tnanslatk>n Initiation complex is tiien formed the 
assembly of elR& tiie RNA heffcase elF4A. and eiF4G. a 
scaRbldlng protein that mediates ttie binding of tiie 408 ribo- 
somal subunit to the mRNA nrxilecule through Interaction with 
the elF3 protein present on the 40S ribosome. e]F4A and elF4B 
partidpate ai metting the secondary structure ol tiie 5' UTR of 
the mmK The 438 miSatbn complex (40S/elF^et-ti=lNA/ 
GTP compte)0 scsns tiie mRNA in a 5'-^' direction until it 
encounters an AUG start oodon. This start oodon Is ttien tiase- 
patred to tiie antkxidon of lnt»ator ti^ Ibmr^ tiie ^ 
ation complex. The initiation factors are then displaced from the 
48S complex, and tiie 60S ribosome Joins to fonn tiie 80S 
ribosome. 

Unlike most eukaryotic translation, translation Inltladon of 
ceriain mFUviAs, such as the picomavims RNA, is cap inde- 
pendent and occurs t^ internal ritnsome entry. This mecha- 
nism does not raqidreelF4£ Btiier tiie 43S complex can bbid 
the Initiation codon dlrectiytinough Interaction witti the IRES in 
the 5' UTR such as in the encephalomyocanfitis virus, or it can 
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F^. 7. Translation tnHIatton In eukaryotes. Tho 4E-BPs are hypeiptios- 
phoryteted to release e I RE so that It can Interact wHh the 5* cap, and the 
er4F Initiation complex is aao^lM. The faiteracUon of po{y(A) binding 
protein wWi the Initiation oomplex and (^lartzation of the roRNA Is not 
depictBd bi the diagrarn. The aecondaiy structured 
the 40S litxjsofnal subunft b bound Id elF3, and the ternary complex 
consbling of elF2. QTP» and the Met-WNAare recniHed to the njRNA. The 
ribosonje scene the mRNA in a 5'-^3' dbecHon until an AUG start codon 
la found in the appropriate aequence context The initiation factors are 
released, and the lar^ Hbosomal subunit b reonitted. 



im^V attach to tKe IRES and then reach the inftiatbn codon by 
scanning or transfer, as Is the case wfth the polfovims (1). 

Regulation of Tmnski^ IrUOatlan 
TranslaHon Initiation fean be regulated by alterations in the 
expression or phosphorylation status of the various fact<M^ 
Involved. Key components in translatlonal regulation that 
may provide potential therapeutic targets tbilow, 

elF4& elF4E ptays a central role In translation regulation. 
It Is the least abundant of the Initiation factors and Is con- 
sidered the rate-limiting component for initiation of cap- 
dependent translation. eIRE may also be Involved in mRNA 
splidng, mRNA 3' processing, and mRNA nucleocytoplas- 
mlc transport (2). elF4E expression can be Increased at the 
transcriptional level In response to serum or growth fectors 
(3), eF4E overexpression may cause preferential translation 
of mRNAs containing excess secondary structure m th^r 
5' UTR ttiat are nonmally dtscriminalted a^nst by the trans- 



lational machinery and thus are inefficiently translated (4-7). 
As examples of thls» overfflcpresslon of eIF4E promotes In- 
creased translation of vascular endothelial grtawth lactor, 
flbrobiasl growth factor*2, and cy<ain D1 (2, 8. 9). 

Anoth^ mechanism of ctxiM is the regulation of elF4E 
phosphorylation. elF4E phosphorylation is mediated by the 
mitogen-activated protein Wnaso-interacting kinase 1 , which 
Is activated by the mitogen-acttvafted pathway actlva&ig 
extracellular s^nalHfelated Idnases and the stress-activated 
paftway acting through p38 mitogen-activated protein Id- 
nase (10'-13). Several mitogens, such as semm, platelet- 
derived growth factor, epldenmal growth factor, Insulin, 
angiotensin II, sn^ kinase overexpression, and nas over-* 
expression, lead to elF4E phosphorylation (14). The phos- 
phoiylation status of elF4E is usually conelated vtm the 
translatlonal rate and growth statue of the cell; however; 
elF4E pho^horylatlon has also been observed In response 
to some celMar stresses when transiatlonal rates actually 
decrease (15). Thus, further study Is needed to understand 
the effects of eiF4E phosphorylation on eIF4E activity. 

Another mechanism of regulation Is the alteration of elF4E 
avaUabl% by the binding of elF4E to the elF4E-bMing pro- 
teins (4E-BP, also known as PHA8-1). 4ErBPs compete vi^ 
e!F4G for a blnc^ site in eiF4& The binding of eiF4E to the 
best characterized eJF4E-bindlng protein, 4E-BP1, is regu- 
lated by 4E-BP1 phosphorylation. Hypophosphoryiated 4E- 
BP1 binds to eIF4E, whereas 4E-BP1 l^rperphosphorylatton 
decreases this binding. Insulin, angiotensin, epidama! 
growth factor, platelet-derived growth factor, hepato^te 
growth factor, nerve growth factor, Insulin-IBce growth factors 
I and B. InterleuWn 3, flrahulocyle-macrophage colony-stlm- 
ulatlng factor + steel factor, gastrin, and the adenovlrxis have 
all been reported to Induce phosphorylation of 4E-BP1 and 
to decrease the ability of 4E-BP1 to bind elF4E (15, 16). 
Conversely, deprivation of nutrients or growth fact<^ results 
in 4E-BP1 dephosphorylation, an increase In elF4E binding, 
and a decrease In cap-dependent translation. 

pTOSSKfnase. Phosphorylath>nofribo8oma!40Spn:ytein 
Se by S8K Is thought to play an Important role In translatlonal 
regulalton. S8K mouse embryonfc cells proliferate more 
slowly than do parental cells, demonstrating that S6K has a 
positive influence on cell proliferation (17). S6K regulates the 
translation of a group of mRNAs possessing a 5' terminal 
ollgopyr!nriidinetract^'10P)1bundatthe5' ^ 
piot^ rrtRNAs and other mRNAs coding for components of 
the transiatfonal madilnery. Phosphorylatton of S6K b regu- 
lated in p^ based on the avallab% of nutrients (18, 19) and is 
stimulated by several gnowth factors, such as platelet-derived 
gowth factor and Insuiln-llkB growth factor I {2(3). 

elF2a Phosphofytation. The binding of the Initiator tRIMA 
to the small ribosomal unit is mediated by Iranslatkm initia- 
tion factor elF2. Phosphorylatton of the a-subunit of elF2 
prevents fonnation of the elF2/GTP/Met-tRNA complex and 
Inhibits giobd protein synttiesfe (21, 22). elF2a Is phospho- 
rylated under a variety of oondltk}ns, such as vbal Infedkm, 
nutrient derivation, heme deprivation, and apoptosis 
elF2a Is phosphCHylated by heme-regutated frihibftor, nutrient- 
regulated protein kfne»e, and the IFN-lnduced, double^ 
stranded RNA-actlvated protein kinase (PKR; Ref. 23). 
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tbomTOR Signaling Pathway. The macrolide antft^otic 
mfnniyc^ (SiraUmus; wyath-Ayerst Research, CoSegeiville, 
RA) has been ihe subject of Intensive study because ft In- 
hibits signal transduction pathways involved in T*ce!l activa* 
tlon. The rapamydn-sensWve componwit of these pathways 
tSRiTDR (also caned l=RAP or RAFTI). mTOR Is the mam- 
maianhoniologue of the yeast TOR proteins that regulate 
progression and translation In response to nutrient availabil- 
ity mTOR Is a s^e-threonlne idnase that modulates 
translation Initiation by after! ng the phosphorylation status of 
4E-BPt and S6K (Fig. 2; Ref. 26). 

4E'BP1 IS ptiosphoiylated on multiple residues. mTOR phos- 
phoryiates the Thr-ar and Vfvr-Ae residues of 4E^P1 h yhro 
f?Sj; however, phoqshorylatlon at tiiese sttss Is not assodated 
wHh a loss of dF4E binding. Phosphorylation of Thi^7 ml 
Thr^ is required for subsequent phosphorylation at severed 
COOfftermlnal, s^rum-sensltive sftes; a comt^nation of these 
phosphocylati^ events appears to be needed to inhM the 
binding of 4&BP1 to dF4£ ^ The product of the ATM gene, 
p38/M^1 patiiway, and prot^ Idnase C<r also play a role In 
4E-BP1 phosphorylation ^7-29). 

S6Kand 4E-BP1 are also regulated. In part, by PI3K and Its 
downslream protein kinase Akt PTEN Is a phosphatase that 
negatively regulates PI3K signaling. PTEN nuO ceils have 
oonstttutively active of Akt, vi^ Increased S6K activfty and 
S6 phosphorylation (30). S6K activfty Is Inhibited botii by 
PI3K Inhlbftors worlmannln and LY294002 and by mTOR 
Inhibitor rapamycin (24). Akt phosphoiylates Ser-2448 fai 
mTOR in WbiOt and this site Is phosphorylated upon Akt 
aclivaibn In vfvo (31^). Thus. mTOR Is regulated b/ the 
P^K/Akt pathway; however, this does riot appear to be the 
only mode of regulation of mTOR activfty. Whettier ttie PI3K 
patiiwsy also regulates S6K and 4E-BP1 phosphcnylatbn 
independent of mTOR is conbroverslal. 

Interestingly, mTOR autophosphoryiation Is blocked by wort- 
mannin but not by rsparr^ Ihls seeming Inoonstete^ 
suggeststiiat mTOR-responslve regulation of 4E-«P1 and S6K 
activity occurs through a mechanism other than Intrinsic mTOR 
kinase activtly. An altematepatiiway for 4E-BP1 and 86K phos- 
phorylation mTOR acti\% is by ttie Inhibition of a phospha- 
tase. Tieatmerrt wftfi caiyculln A, an inhibitor of pho^hatases 1 
and 2A, reduces rapamydrt-induced dephosphoiylation of 4&- 
BP1 and S6K by rapamydn (3S). PF^ interacts wltti fuB-lengtti 
S6K but not wfth a 56K mutant tiiat » resistant to dephospho- 
rylatfon resulting from rapamydn. mTOR phosphorytates PI^ 
In v*o; however, how tills process alters PP2A activity is not 
known. These results are consistent wfth tiie nKxiei that phos- 
phorylation of a phosphatase by mTOR prevents dephospho- 
rylation of 4E-BP1 and S6K, and conversety, tiiat nutrient d^ 
rivattonand rapamydn bk)ck inhibition of the phosphatase by 
mTOR. 

Potyadenylation. The poly(A) tall in eukaryotic mRNA Is 
ir>ipoitant In enhancing translation Inftlatlon and mRNA sta- 
bility. Polyadenylation plays a key R>ie In regulating gene 
exfTOsslon during oogenesis and earty embryogenesls. 
Some mRNA that are translationalty inactive In the oocyte are 
polyadenytated conconrUtantiy wfth translational activation In 
oocyte maturatton, whereas other mRNAs that are transla- 
tlonatiy active during oogenesis are deadenylated and trans- 
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I Tkanslation of STOP mRNAs 

OB^Hiqicpdcat TiCRiwlai ion 

(mRNAs wllh Idg^y siroctued S* mits) 

Fig. 2. Regutation of translation Initiation by signal imnsduction path- 
ways. Signaling via p3S. extracoiliilar signakelaiad ktna&a^ PtSIC and 
mTOR can aB acttvata translation tnitfatSon. 



lationally silenced ^-^). Thus, control of pdy^ tall syn* 
thesis Is an Important regulatory step In gene expression. 
Hie 5' cap and poly(^ taH are thougtA to funcQon syneri^- 
tlcally to regulate mRNA translational efficiency (39, 40). 

RNA Packaging. Most RNA-t)lndIng proteins are assem- 
bled on a transcript at the time of transcription, thus deter- 
mining the translational fete of the transcript (41). A highly 
consented family of Y-box proteins Is found In cytoplasmic 
messenger ribonucleoprotein particles, where the proteins 
are thought to pl^ a rde In restricting the recruitment of 
mRNA to the transMonai machine (41-43). The major 
mRNA-assoclated protein, YB-1, destabllfees the Interaction 
of elRE and the 5' mRNA cap in vrtro, and overexpresslon of 
YB-1 results In translational repression In vivo (44). Thus, 
alterations In RNA packaging can also play an important role 
In translatlonsd regulation. 

Translation Alterations Encountered In Cancer 

Three main alterations at the translational level occur In cancen 
variations In mRNA sequences that increase or decrease trans- 
latlbnal effldmcy, changes In the expies^n or availabirrty of 
compon^ of the translational machinery, and activation of 
translation thn^ugh at>6nantiy activated slgn^ trensducto 
pathvrays. The first alteratJon affects the toanslation of an Indl- 
vidua! mRNA that may play a role In carcinogenesis. The sec- 
ond and third alterations can lead to more gbtjal changes, such 
as an Increase In the overaO rate of protein ^thesis, and the 
translational activation of several mRNA species. 

VarfaUons ht nMiA Seguefice 
Variations in mBNfii sequence affect the translation^ efR- 
dencyofthetranscripLAbrief description of these variatkms 
and exami^ of each mechanism follow. 

Mutations. Mutations in the mRNA sequence, especially 
in the 5' UTR. can alter Its ftanslatlonal efficiency, as seen In 
the following examples. 
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o-inyc Safto et a/, proposed that translation of MMength 
^myo Is repfdssed, whereas in several Burtdtt lymphomas 
^havedd^onsof thernWAS' irTRtiansiallonGf 
is more efficient (45). Mm recentiy, it was reported that tf)e 
5' UTR of c-myc contains IRES, and thus c-myc transla- 
tion can be initiated by a cap-Independent as well as a 
cap-d^)endent mechanism (46, 47). In patients wWi muWple 
myeloma, a G-*T mutation In the omyc IRES was identified 
(48) and found to cause an enhanced initiation of translation 
via internal rit>o$oma] entry (49). 

. BBCAU A somatic point mutation (117 Qr^O) fri position 
-3 vt^ respect to the start codon of the BAC47 gene was 
Identifled in a highly ag^esslve sporadic breast cancer (50). 
Chimeric cor^tnicls consisting of the wild-type or mutated 
BRCA1 UTR and a downsbeam luciferase repoiter dem- 
oristratedadecreaaa in the tiansiatk}n£d efficient 
UTR mutation. 

pycfip}-cfepen(«snf KAiaselrt^ Some inherited 

melanoma IdrKlreds have a G-^T transyersion at base -34 
of cydfn^lependent kinase lnhibitor-2A, whirti encodes a 
cydin-dependent kinase 4/cyclln-dependent kinase 6 Wna^ 
Inhibitor tmporlant In Gt checkpoint regulation (51), This 
mutation^gives rise to a novel AUG translation Initiation 
QOdon, creating an upstream open reading frame that ccmi- 
petes Ibr scanning rit>osomes and decreases translation 
from the wild-type AUa 

Altemate SpOdng and Allemato Transcription Start 
Sites. AlteiHfions in splteing and altemate trmnscriptton sites 
can lead lo variations In 5' im^ sequence, lengthy and se^^ 
tty sinjcturdb ultimately impading translalSorad effldency. 

ATM, The ATM gene tias four nonooding axons in its 5' 
UTR that undergo extensive alternative splldng (52). The 
contents of 12 different 5' UTRs that show considerat^e 
diversity In length and sequence have been identified. These 
divergent 6' leader sequences play an Important role In the 
transtafional regulation of the ATM gene. 

mdm, in a suk>set of tumorSiOversxpresslon of the onco- 
protein nKlm2 results In ^fihanoed transiatbn of the mdm2 
mRMA. Use of different proinolers leads to two md7i2 tran- 
scripts that differ «rily In their 5' leaders (53). The tonger 6' 
imi contains two upstream open reading frames^ and this 
mRNA is loaded vm\ ribosomes ineffidently compared with 
the short 5' UTR. 

ARCAf. In a normal mammary ^and, WGA^ mRNA is 
repressed wnth a shorter leader sequence (5'UrRa), whereas 
^ sporadic breast cancer tissue, BBCA1 mRNA Is expressed 
with a longer leader sequence (5' ITTRb); the translatlonal 
efRdency of transcripts containing 5' UTRb Is 1 0 times lower 
than that of transcripts contalru'ng 5' UTRa (54). 

TQF-ta^ 7GF-03mRhlAhdudesa1.14di5'UTawhlch 
exerts anlnhibitoryeff^ on tanslation. Many human breast 
cancer oeB Rnes contain a novd TGF-p3 trai^cript with a 5' 
UTR that Is 870 nudeoUdes shorter and has a 7-lbI d greater 
transtetlkwal efWdency than the nonmaJ TGF^pS mRNA p5). 

Altemate Polyadenylafion Sitea. Multiple polyadenyl- 
ation signals leading to the generatfon of severed transcripts 
with diflering 3' UTR have beex\ described for several mRNA 
spedes, such as the RET proto-oncogene (5Q, ATM gene 
(62), tissue InhlbHor of metaliopr0telnase&^ (67). RHOA 



pmto-oncogene {58), and calmoduiln-l {58). Although the 
effect of these altemate 3' UTRs on translation Is not yet 
known, they may be important in RNA-pmteln Interactions 
that affect translational recruitment The role of these alter- 
ations in cancer development and progression Is unknown. 



^Itenaffons In the Components of the 
Tw^btkm B^iMnmy 

AHeraHons in ttie components of translajfon machinery can 
talce many fbnns. 

Ovmxpressslon of eiF4E Overacpression of elF4E 
causes malignant transfbnnation in rodent cells (60) and the 
dwegulation of HeLa cdl growth (61). Polunovsl<y ef aH (62) 
found that elF4E oversxpres^on substitutes for serum and 
indh/idual ^wth factors In preserving vlat^ity of fSkrobiasis, 

whidi suggests that elF4E can mecSate both proHferatlve and 
survhal signaling. 

Qevated levels of e!F4E mRNA have been found In a broad 
spectnjm of transformed ceU lines (63). elF^ levels aie 
elevated in all ductal cardnoma in sAu spedmens and Inva- 
sive chidal carcinomas, compared with benign breast spec* 
imens evaluated wltti Western btot analysis (84, 6^. Prelim- 
inary studies suggest that tills overexpreseion is attributable 
to gene amplification (66). 

There am accunridafing data suggesting that elF4E overex- 
pression can be valuable as a prognostic marioer. elF4Eover- 
exfveeslon wasfbund In a ratrospecQwe stiKl^ to be amariier of 
poor prDgnosls In stages i to Ul braast caR^U3nia(^Verl^ 
tibn of the prognostic value of elF4E m bi^ cancer is now 
under way in a prospective trial (67). However, In a different 
study, elF4E repression was oonn^ed with the aggressive 
behavior of non-HodgWn's lymphonrKS (6^. In a prospective 
analysis of paints with head and neck cancer, elevated levels 
d elF4E in histologically tumor-fiee suiglcal margins predicted 
a signfficantiy tncreased risk of k)CdHnBgk)nal recunenoe (d). 
These results Oi suggest that elF4E overexpressicHi can be 
used to sded pafients nrrfght beneft fh)m nrore aggres^^ 
systemic ther^. Fufthornae, the head and neck cancer data 
suggest that elF4E ovensxpresslon is a fieki defect and can be 
used to guide kical therapy. 

Alterations In Ottier Initiation Actors. Altmtions In a 
number of other irutlation factors have been associated with 
cancer. Overprodudton of dF4G, similar to eiF4E. leads to 
malignant transfonmatlon !n vitro (6S). eIF-2a is found in 
increased levels In tnonchioloah/eolar cardnomas of the lung 
(3). InltlaBon ^or elF-4A1 is overexpressed In melanoma 
(70) and hepatocellular carcinoma (71). The p40 subunit d 
translation initiation fader 3 Is amplified and overexpressed 
in breasland prostate cancer (72), and the elF3-p1 10 subunit 
Is overexpressed In testicular seminoma (73). The rde that 
overexpresslon of these initiation fadors plays on the devel- 
o]ment and (Nx>grBss!on of cancer, if any, is not known. 

Overexpresslon of S6K. S6K Is amplified and highly 
overexpressed in the MC)F7 t»east cancer cdl line, com- 
pared with nomnal mammary er^thdium (7^. bi a study by 
Bartund ef af. (74), S6K amnplified in 59 of 668 primary 
breast tumors, and a statteticaf ly significant assodatlon was 
obsenred between amplification and poor prognosis. 
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Owracitfesslon of PAP. PAP catalyzes 3' poly{A) syn- 
thesisi PAP is Gveraxpressed in hionan cancer cells com* 
pared with normal and viral^ transtonned cells (7€). PAP 
enzymaAlc activity in breast tun^ 
PAP protein levels (7Q and, In mammary tumor cytosols, was 
found to be an Independentfactor for predicting suralval (76). 
Utile is known, howeva^, about how PAP expression or bd^ 
ftvtty affects the translatlona] piofiie. 

AltaralfonslnRNArblmllngProlelne. Even less is known 
about atterations b RNA packaging In cancer. Increased ex- 
pressk>n and nuclear tocallzatkx) of the im4)lndlng pnsteln 

YB-1 ars Incflcators of a poor prognosis for breast cancer (TT), 
noTMnroO cea lung cancer (78), and ovarian cancer (7^ 
ever, this efftet may be mediated at least In part at the level of 
tianscrlpttoa because YB-1 increases chemoresistance by en- 
hancing the transcriptk)n of a multidrug resistance gen^ 



Actii/aUon of Signal Transduce Pathways 
ActlvaHon of s^nal transducSon pathways by bss of tumor 
auppiessor genes or ovei«xpressk>n of certain tyrosine kinases 
can OMfiblbute to the growth and ag^essiveness of tuiriors. An 
lnnpoitan|,.mutant b human cancers Is me tumor suppressor 
genePIE^r, wtMi leadstD the actKffidfcm of^ie PI3^ 
vvay* Acttvalion cf P»K and Akt Mines the oncogerUc trans- 
fomiaSon of (^dctonen^ fibroblasts. The iransf^^ 
showconsatutlvephosphorylallonofS6Kandof4&BP1 
A mulant Akt that retains kinase actMty but does not (^los- 
phQiylateS6Kor4&BP1 does not transfonm fibroblasts, whteh 
smgTBsts a correlatk>n between the onoogertfci^ of PI3K and 
Akt and the phosphorylatkm of S6K and 4&eP1 (B1). 

SfiMeral tyrosine kinases such as plateieMerived growth 
factor, Insulln-Uke growth fatitor, HER2/heu, and epldennal 
growV) factor receptor are overexpressed in cancer. Be- 
cause these kinases acth^ downstream si^ial transduc- 
tion pathvvays known to alter translation initlatk)n, activa^ 
of translation Is tlkely to contn'bute to the growth and aggrBS- 
slveness of these tumors. RirUiermore, the mFVIA for many 
of these kinases themselves are under translatlonal control. 
I=br example, HER2/neu mRNA is translattonally controlied 
both by a short upstream open reading frame that represses 
HHR2/heu translation In a cell type-Independent manner and 
by a distinct cell type-dependent mechanism that Incraasas 
tFanslatk>nal efllclency (82). HER2/neu translation Is different 
In transfdmned and normal ceils. Itius, it is possible that 
^Jterattons at the translatlonal level can in part account for 
the discrepancy t>etween HEfi2/neu gene amplification de- 
tected by fluorescence /nsftu hybridization and protein levels 
detected tiy immunohistochemlcal assays. 



Translation Targets of Selected Cancer Therapy 
Components of the translatton machinery and signal path- 
ways frwolved In the activation of translatoi Inltlatlcm napi^ 
sent good targets for cancer therapy. 

Targotbtg tha mTOR Sffnalbtg Mbwa^ JR^pssm^ 
andTim^ttn 

Rapamycin Inhibits the prbliferstkNi of lymphocytes. It was 
InrUaly devetoped as an imnrainosuppresstve dnig for organ 



transplantation. Rapamycin with FKBP 12 (FK506-bkidIng 
protein, M, 12,000) binds to mlDR to Inhibit its lunctton. 

Rapamydn cmjses a smaU but significant reduction in the 
inmation rate of protein synthesis (83). It blocks cell growth in 
part by bk)cking S8 phosphorylation and selectively sup- 
pressing the translatton of 5' TOP mRNAs, such as ribosomal 
proteins, and ek>nga!aon fectore (B^-BQ. Rapamydn also 
bkKks 4E-BP1 phosphorylatfon and Inhibits cap-dependent 
but not cap-independent translation (17, 86). 

The rapamydn-^ensKve signal transduction pafrivyray, acti- 
vated during maOgnanttnansfonnatkDn and cancer progression, 
is now b^g stiKfied as a taiget for cancor therapy (87). Pn:>$- 
tafe, breast, sinaO ceil iung» gitoblastorna, melanoma^ andTH)en 
leukentia are among me cancer Ones most sensitive to the 
rapaniydn anatogue CCi-779 (Wyeth-Ayerst Research; Ref. 
87). In rhabdomycosarcoina cell Knes, rapamycin is either cyto- 
statte or cytockid, depending on the p53 status of the ceO; p53 
wQd-type ceRs treated wHh rapamydn arrest In me phase 
and maintain thelr.vlabinty, whereas p53 mutant cells accumu- 
late InQ, and undeigoapoptosis^ 89). In arecontiy reported 
stu<^ using human prinrtltive neuroeotodemnal tumor and 
medullobtastoma models, rapamycfri exhibited ntora cytotox- 
k% In cornblnalion wim cteplatin and camptothecin than as a 
single agent fri Ww, ca-7TO deteyed gncwth of xenogr^ 
16096 afterl weekof therapy and24Q%afler2we^AsIn0e 
hlgh^tosa administration caused a 37% decrease in tumor 
voUima Growth InNbltkm £ri vfiA9 was 
dspiatin in oornbtatkm wim CX:i-779 than with ds^ 
(9G). thus, precilnkal studies suggest mat raparnycfri ana-* 
togues are useAjl as single agents and in comt^haitton with 
^lemotherapy. 

Rapamycin analogues CCi-779 and RAD001 (Novartls, 
Basel, Switzerland) are now In dlnk^l trials. Because of the 
known effect oif rapamycin on lymphocyte prollferatton, a 
potential pnH>lem wWi rapamycin anedogues is Immunosup- 
pression. However, although prokxiged ftnmunosuppressten 
can result from rEy>amycln and 00*779 administered on 
continuous-dose schedules, the Immunosuppressive effects 
of rapamydn analogues resolve In -24 h after therapy 
(91). The principal toxteltiss of CCI-779 have Included der- 
matologlcal toxicity, myektsupprsssbn, Infectk^n, mucositis, 
dianhea, reversible elevations In liver function tests, hyper- 
glycemia, hypokalemia, hypocalcemia, and depression (87, 
92-94). Phase 11 trials of CCI-779 have been conducted In 
advanced renal cell carcinoma and in stage 111/IV breast 
carcinoma patients who failed with prior chemotherapy. In 
the results reported In abstract fomn, although there were no 
complete responses, partial responses virere docunwnted In 
both renal cell carcinoma and In breast carcinoma {94, 95). 
Thus. CCi-779 has documented preliminary clinical activity In 
a previously treated, unselected paUent populatton. 

Active investigation Is under way into patient selection for 
mTOR InhteRors. Several studies have found an enhanced 
efficacy of CCI-779 in PTB4-nuH tumors C30, 96). Another 
study found ttiat six of eight breast cancer cell lines wers 
responsive to CCI-779, although only two of these lines 
laci^ PTEN (97) There was, however, a positive conelotlon 
between Akt activation and CCI-779 sensftMty (97). This 
correlation suggests that activation of the PI3K-Akt pathway. 
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negardless of wfielher tt Is attributable to a PTEN mutatkm or 
^ overaxpresslon of receptor tyrosine kinases, mates can- 
cer celt amenable to mTOR-dfrected therapy. In contrast, 
lower levels of the tcofget of mTDR, 4E-&P1, are associated 
with ropainycln resistance; thus, a lower 4E-BP1/elF4E ratio 
msQf (sredlct rapamydn resistance (98). 

Another mode of activity for rapannydn and hs analogues 
appeaiB to be through Inhibition of anglogenesls. Tl^ actlv- 
ily may be both through direct InhlbitlQn of endotheBal cell 
proiifefation as a result of mTOn Inhibition in these c^ls or by 
inhibition of translation of such proangiogenic factors as 
vascular endothelial growth factor In tumor cells (99, 100). 

The anglogenesis Inhibitor turr^tln, another anticancer 
(Hig currently undier shidy, was also found recently to Inhibit 
faanslatkm In endothellai cells (101). Through a requisite in- 
teraction with integrin, tumstaUn Inhibits activation of the 
PI3K/Akt pathway and mTOR In endothelial cells and pre- 
vents dissociation of elF4E from 4E-BP1, hereby Inhibiting 
cap-dependent translation. These findings suggest that en- 
dothelial cells are especially sensitive to therapies targetfhg 
tile mlDR-slgnallng pathway. 

Tarjgeitltg df^cc EPA^ CkOfUnamh, nuUt-T, 
Bnd Flavonoids 

EPA is an n^ polyunsaturated fEdty acid found In the fisti- 
based diets of populations havfrig a low inddenoe of cancer 
(102!). EPA Inhibits the proHfenation of cancer cells (103), as 
Weil as In animal models (104. 105). It blocks ceil division by 
inhibiting translalion tnitiation (105). EPA releases Ca^*^ firom 
Intracellular stores v^lle Inhibiting their refining, thereby ac- 
tivating PKR. PKR, in turn phosphorylates and inhltHts elF2a, 
resulting in the inhiblOon of protein synthe^ at the lev^ 
translation Ihttiatlon. Similarly, clotrimazole, a potent antipio- 
liferatlve agent (n vitro and !n vfvo> inhibits cell growth through 
depletion of Ca'^^ stores, activation of PKR, and phospho- 
rylation of elF2a (106). Consequently, clotrimazole preferen* 
tially decreases the expression of cycilns A, E, and D1. 
resulting in blockage of the cell ^e In 

mcto-r is a novel tumor auppreesor gene t>^ng developed 
as a gene therapy agent. Adenoviral transfer of mda-7 ^ 
mdaT) Induces apoptosls In many cancer cells Including 
breast, colorectal, and lung cancer (107-109). Ad-mda7 also 
Induces and activates PKR, virhlch leads to phosphorylation 
of elF2a and Induction of apoptosis (110). 

Fiavonolds such as genistein and quemetin suppress tu- 
mor oen growth. Ail three mammalian eli=2a kinases, PKR. 
hQme-rsgulatad mhibltor, and PERK/PEK, are activated by 
fiavcmoids, with (^losphorylation of elF2a and Inhibition of 
protein synthesis (111). 



Targemg dF4A and 9IF4B AntisMSB RNA 
and Peptides 

Antisense e)qpres3ion of elF4A decreases the proDfMlon ^ 
of melanomaoelis C1 1^. Sequestiatton of elF4E by overa)qpr^ 
8ionof4E-BP1 is proapoptotic and decreases tumorlgttftfcity 
(113, 114). Reducitan of elF4E wtth antisense RNA decreases 
soft agar growth, increases tumor latency, and Increases tiie 
rates of tumor dout^ng times (7). Antisense elF4E RMA treat- 



inent dso recbices the expiesslbn of angiogenic factors (115) 
andhasbeen proposed asapot^ittialadiuvanttherar^fbr head 
and neck cancers, particutaly when elevated elF4E Is found In 
surgical maiglns. Small molecule inhibitors that bind tiie ^F4G^ 
4E-BP1 -binding domain of efF4E are proapoptotic (116) and 
are also being actively pursued. 



BxpMlkig SelecHm TranslaHon for Gem Therapy 

A diffmnt therapeutic apptx)ach tiiat takes advantage of tiie 
enhanced cap-dependent translaticm In cancer ceils is tiie use 
of gene tiierapy vectors encoding siridde genes wttii highly 
stfiictiffed 5' IJTRThese mRNA would tius be at aoompetitive 
disadvant^ in nonnal cells and not translate well, whereas in 
cancer oells» tii^ wouU transiata more efficiently. For example, 
the inboductibn of the 5' um of fibroblast gnaw^ 
ttie coding sequence of /)efpessi^feorW^5type-f ihyaikSne 
Mhase gene» aBows for selective translation of />Gfpes s&r^^ 
vfrus thymidine kinase gene In breast cancer cefl lines 
compared with normal mammary cefl Bnes and resuHs In se- 
lective sensiti^ to ganddovbraiT). 



To ward the Future 

Translation is a cru(^ process N> every cdL However, several 
alterations in tnonslational obiitroi occur in ccui^ 
appear to need an abenantiy activated transiationd state for 
siavival, tiius flowing the targeting of translation Initiation wtth 
surprisingly low to)dctty. Components of the translaOonal ma- 
chineiy, such as elF4£, and signal transduction pcdhways In- 
volved in translation Initiation, sinh mTOR, represent promising 
targets for canoer^eiapy. InhR^tors of the mTOR have afaeady 
shown some preliminary activi^ In dMcal trials. It Is posslkrie 
that Witt! the devetopment cf better predtetive marloem and 
better patient ejection, r^ponse rates to single-agertt therapy 
can be Improved. Similar to ottier cytostatic agerrts, howrever, 
mTOR inhibitors are most filcely to achlevia dlnlcal utility In 
combination tiierapy. In the Interim, our Increasing understand- 
ing of translation initiation and signal transduction pathways 
promise to lead to tiie Identification of new tiierapeutic targete 
in Vie near liiture. 
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